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1 nCTECTTf^N OF HOMAN ADEMOVTRnS 

The present invention relates to DNA and proteins of human 
adenovirus type 41 and methods of detection thereof. In particular, 
the present invention relates to the isolation of a 41.4 kd fiber 
5 protein (**short" fiber protein) and a 60.6 kd fiber protein (**long" 
fiber protein) of human adenovirus type 41 (Ad41) , as well as proteins 
derived from the Ad41 E3 region, thereby providing virus -derived 
antigens and active derivatives and parts thereof, useful in the 
development of diagnostic assays, DNA probes and vaccines for said 

10 virus or other related viruses belonging to the htaman enteric adeno- 
virus family. The present invention is further directed to 
recombinant DNA molecules containing the Ad41 long fiber protein 
gene, the Ad41 short fiber protein gene and the Ad41 E3, gene (encoding 
the proteins RL-1 to RL-6) thereby providing a source of recoznbinint 

15 viral coiTOonents useful in the development of said diagnostic assays 
for Ad41. The present invention is also directed to first antibodies 
specific to the above-identified Ad41 viral con5)onents and to second 
antibodies specific to the first antibodies. These second antibodies 
are also useful in the development of diagnostic assays for Ad41 and 

20 other adenoviruses . 

Adenoviaruses are sinple DNA-containing viruses (i.e., 
composed of only DNA and protein) that multiply in the cell nucleus 
of the host. These viruses induce latent or acute infections in 
tonsils, adenoids, lungs, bladder and cornea as well as the 

25 gastrointestinal tract and are readily activated. Several adeno- 
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1 viruses are the first coirmon viruses of humans shown to be oncogenic 
for lower animals under special experimental circurrstances . The 
adenoviruses may serve as ^'helpers" for adeno-associated viruses 
which cannot replicate in their absence. 
5 The viral particles of the adenovirus have a dense central 

core and an outer coat known as the capsid. These particles have an 
icosahedral configuration and are composed of 252 capsomers : 240 
heKons make up the faces and edges of the equilateral triangles and 
12 pentons comprise the vertices . The hexons are truncated triangular 
10 or polygonal prisms with a central hole. The pentons are more conplex, 
consisting of a polygonal base with an attached fiber protein^ whose 
length (i.e.^ short or long) varies with viral type. Minor capsid 
proteins are also associated with the hexons or pentons and confer 
stability on the capsid to form links with the core proteins, ami 
15 to function in virion assembly. 

Each virion contains one linear, double* stranded DNA 
molecule associated with proteins to form the core of the adenovirus . 

The early region 3 (£3) of adenoviruses plays a critical 
role in pathogenesis of the virus' s disease process even though none 
20 of its gene products are essential for replication of the virus in 
cell cultures. Not all proteins coded in the E3 regions of 
adenoviruses have been identified, even for the most conmonly studied 
adenovirus, type 2 {Ad2) . However, it has been postulated that they 
mediate cellular or immunological responses through structural or 
25 functional homology to regulatory molecules. For this reason, it is 
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1 possible that proteins generated from the E3 region, or their 
derivatives f can be used in therapy as modulators of the immune 
response (e.g., as an immunostimulation system in AIDS pa-ients) or 
as anti-cancer agents to modify the action of various growth factors . 
5 In addition^ . specific E3 proteins can be used to distinguish between 
different adenovirus types. 

Adenoviruses are widespread in nature. The 39 accepted 
members of the adenovirus family have similar chemical and physical 
characteristics and a family cross-reactive antigen but are 

10 distinguished by antibodies to their individual type-specific 
antigens: ait least 41 are from humans and the rest from various 
animals . « 

The enteric adenoviruses, such as Adenovirus Type 40 or 
41 (and also known as Type F Enteric Adenoviruses) , are a virus group^ 

15 that cause serious intestinal and diarrheal diseases of young 
children. In 1978, the World Health Organization initiated a program 
for global prevention and control for such childhood diseases. As 
a result, the relative importance of various pathogens in the etiology 
of diarrhea in many parts of the world has been recognized. For 

20 example, rotaviruses, which rank as the most prevalent viral pathogen 
in childhood diarrhea, may now be close to control as many vaccines 
are now in eight. This has been made possible through very intensive 
research over the past decade. 

However, the control of enteric adenoviruses, which are 

25 responsible for at least 15% of all cases of severe infantile 
gastroenteritis, is not yet within reach. Although they are second 
after rotaviruses as viral agents causing this type of infection, 
enteric adenoviruses remain. a poorly defined group of viruses. The 
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1 paucity of research done on enteric adenoviruses is mainly due to 
the difficulty of propagating the viruses in cultures. For this 
reason, there is no sensitive, fast, and diagnostic procedure able 
to distinguish between enteric adenoviruses and other adenoviruses 
5 (Group A, B, C, D, and E) which are commonly present in stools but 
are not agents of gastroenteritis. Another reason for studying 
enteric adenoviruses is their possible link to intestinal cancer 
which appears later in the life of infected individuals. 

The standard reference methods for diagnosis of enteric 
10 adenoviruses have been (1) immunoelectron microscopy; (2) type- 
specific neutralization; (3) growth differences on primary human and 
Graham-293 cells . None of these methods are accurate and suitable 
for rapid routine use. Recently a new commercially available enzyme- 
linked intnunoabsorbent assay (ELISA) to detect enteric adenoviruses 
15 (Adeno-Type 40/41 EIA, Cambridge Bioscience) based on a polyclonal 
antibody to enteric adenovirus hexon protein was created, but this 
kit lacks both specificity and sensitivity. 

However, the present invention solves the problems 
associated with the previous methodologies. The present invention 
20 describes a recombinant DNA molecule which can produce at least one 
of Human Adenovirus Type 41 Tak (Ad41) short fiber protein, long fiber 
protein, or proteins RL-1 to RL-6 of the Ad41 E3 region. (There are 
presently several isolates known of human adenovirus type 41, but 
the most common isolate of this adenovirus is human adenovirus type 
25 41 Tak, represented in the present invention. This isolate is the 
standard Ad41 strain and it is listed in the American Type Culture 
collection under catalog number ATCC #VR-930.) 
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1 The Ad41 short and long fiber protein gene and Ad41 £3 

proteins are useful for assays for human enteric adenoviruses since 
they express only minor immunological cross*reactivity between 
adenoviruses belonging to different serotypes; they are unique 
5 adenovirus proteins , (i.e. , Ad41 long fiber protein and possibly the 
short fiber as well are responsible for attachment of the virus to 
specific cellular receptors in the cell membrane during infection) 
and they express selective type-specific antigenicity. The genes of 
the present invention are ideal candidates for specif ic^ selective 

10 monoclonal antibodies based on an enzyme immunoassay CIA) kitr 
a DNA probe assay system and a vaeeine derived from the gene producrs . 
The present invention will not only enhance the understanding of th^ 
mechanism by which human enteric adenoviruses cause disease in 
humans r but will also assist in developing molecular probes fof 

15 diagnosis of such infections. 

The present invention relates to an isolated nucleic acid 
encoding a protein selected f rom hisnan adenovirus type 41 Tak long 
fiber protein, short fiber protein, £3 RL-1 protein, £3 RL-2 protein, 
£3 RL-3 protein, £3 RL-4 protein, £3 RL-5 protein or £3 RL-6 protein. 

20 The present invention also relates to a replicable expression vector 
comprising the nucleic acid encoding a protein selected from human 
adenovirus type 41 Tak long fiber protein, short fiber protein, £3 
RL-1 protein, £3 RL-2 protein, £3 RL-3 protein, E3 RL-4 protein, £3 
RL-5 protein or £3 RL-6 protein opersLbly linked to a nucleotide 

25 sequence capable of effecting an expression of a polypeptide encoded 
by any one of said nucleic acids. 
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1 The present invention further relates to a recombinant 

protein of human enteric adenovirus Type 41 wherein said protein is 
long fiber protein, short fiber protein, RL-1, RL-2, RL-3, RL-4, M*- 
5 or RL-6. 

5 In addition, the present invention relates to a 

polypeptide comprising an antigenic fragment of human adenovirus Type 
41 long fiber protein, short fiber protein, RL-1 protein, RL-2 
protein, RL-3 protein, RL-4 protein, RL-5 protein or RL-6 protein. 
Also the present invention relates to antibodies against a long fiber 
10 protein of hiaman adenovirus Type 41, a short fiber protein, RL-1 
protein, RL-2 protein, RL-3 protein, RL-4 protein, RL-5 protein or 
RL-6 protein. ^ 
Further, the present invention relates to a vaccine for 
irranunization against a human adenovirus comprising the administra-f 
15 tion of a mixture of inactivated Ad4l and at least one of recombinant 
Ad41 long fiber protein, recombinant Ad41 short fiber protein and 
recombinant Ad41 E3 proteins RL-1 to RL-6 or active fragments thereof 
in association with a conventional vaccine cairrier. 

Another aspect of the invention relates to a method of 
20 detecting or diagnosing human adenovirus comprising contacting 
serum, tissue, or tissue extracts of an individual to be tested with 
an antibody against Ad41 long fiber protein, short fiber protein, 
RL-1 protein, RL-2 protein, RL-3 protein, RL-4 protein, RL-5 protein 
or RL-6 protein or an active fragment thereof, for a time and under 
25 condit-ions necessary to form an antibody-antigen complex, and 
detecting any resultant antibody-antigen complex. 
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1 Yet another aspect of the invention is a method for 

detecting human adenovirus Type 41, human adenovirus Ad40 or any 
adenovirus antigenically or structurally sind.lar to human AD41 in 
infected cells in a sample comprising lysing said cells, fixing the 
5 DNA of the infected cells and detecting the DNA containing said long 
fiber protein gene^ short fiber protein gene or E3 gene by a specific 
probe nucleic acid wherein said probe nucleic acid is DNA, cDNA, 
recombinant DNA or RNA. 

Still another aspect of this invention is a compartmen- 
10 talized kit for detection of human adenovirus type 41 comprising at 
leas- one first container adapted to contain an antibody having 
specificity for Ad41 long fiber protein, short fiber procein or E\ 
proteins RL-1 to RL-6 and at least one second container adapted to 
contain a reporter molecule capable of detecting the antibody of said 
15 first container - 

Fig. 1 is a representation of the DNA sequence of the hxunan 
enteric adenovirus Type 41 Tak long fiber protein gene^ and the 
corresponding amino acid sequence of Ad41 long fiber protein. 

Fig. 2 is a representation of the DNA sequence of the human 
20 enteric adenovirus Type 41 Tak short fiber protein gene. 

Fig. 3 is a representation of the amino acid sequence of 
Ad41 short fiber protein. 

25 
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1 Fig. 4 is a representation of the DNA sequence of the human 

enteric adenovirus Type 41 Tak E3 gene. 

Fig. 5 is a representation of the amino acid sequence of 

Ad4 1 RL- 1 protein . 
5 Fig. 6 is a representation of the amino acid sequence of 

Ad4 1 RL- 2 protein . 

Fig. 7 is a representation of the amino acid sequence of 
Ad41 RL-3 protein. 

Fig. S is a representation of the amino acid sequence of 
10 Ad41 RL-4 protein. 

Fig. 9 is a representation of the amino acid sequence Ad41 
protein. 

Fig. 10 is a representation of the amino, acid secpaence 
of Ad41 RL-6 protein. I 
15 Fig. 11 is a representation of a map of the protein coding 

regions in the £3 region and fiber (short and long) area of the human 
enteric adenovirus type 41 Tak. The E3 region is represented by 
proteins RL-1 to RL-6. The map position of the fragment shown is 74% 
to 92%. 

20 The present invention contemplates identification, 

isolation and utilization of struct\iral conponents of Type F 
Adenoviruses. In particular, the present invention relates to the 
human adenovirus Type 41 Tak (Ad41) long fiber protein gene^ short 
fiber protein gene, and the entire E3 gene, and 

25 
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1 diagnostic assays^ monoclonal and polyclonal antibodies^ DNA probes, 
and vaccines prepared relative thereto. This invention provides the 
advantage of a previously unavailable source of virus particles and 
parts thereof, and antigenic determinants and parts thereof, being 
5 highly desirable for its medical and experimental utility 

In accordance with the present invention, the Ad41 long 
fiber protein gene, the Ad41 short fiber protein gene and the Ad41 
£3 gene have been obtained by DNA sequencing of selected clones from 
an Ad41 library using standard techniques • 
10 With respect to the Ad41 fiber protein gene coding for 

a 60.6 led Ad41 fiber protein, henceforth this will be referred to 
in the Specification and Claims, as "Ad41 long fiber protein" and 
"^Ad41 long fiber protein gene". In particular, this Adfll long fiber 
protein gene found in the 1 .9 Kb Smal-EcoRI DNA fragment (map position 
15 86.4% to 92%) of the human enteric Ad41 strain Tak was cloned in 
pBluescript II and sequenced directly using custom oligonucleotide 
primers. The gene coding for the Ad41 long fiber protein was 
identified using the sequence of AdS fiber protein gene as a 
reference. The procedure is outlined in more detail in the Examples . 
20 In general, the fiber protein gene has three structural 

domains,, the tail/ the shaft and the knob, (i.e., NH, (N-terminusl 
- tail, shaft, knob - COOH {C-terminusl ) . Of these three domains, 
the **knob", which is responsible for the interaction of the virus 
with the cellular receptors displayed the lowest homology with 
25 other- human adenoviruses such as Ad2, Ad3, AdS, and Ad7 at the DNA 
or protein level. 
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1 A 650 bp Hind III/Eco RI DNA fragment coding for the *V<nob" 

domain is siibcloned on pUClS vector and used in standard Souchern 
hybridization with DNAs of representative serotypes of the Adenovirus 
subgroups A, C, D, and F. Only huitan enteric adenoviruses Ad 
5 40 and Ad 41 of Type F can be detected. 

The dsDNA sequence of the Ad41 long fiber protein gene 
and subsequent amino acid sequence of Ad41 long fiber protein is 
represented in Fig . 1 . Ad41 long fiber protein shows a high decree 
of homology with Ad40 fiber protein^ except for the shaft region. 
10 The Ad41 long fiber protein gene shaft contains 22 typical aniino acid 
repeats^ whereas Ad40 has only 21 such repeats. (This refers ::o the 
fact that all fiber protein genes sequenced to date have shown a 
characteristic 15-residue motif, which is repeated 6 X-o 12 tiir.es and 
detection of this motif has aided rapid recognition of the sequencf 
15 There is 97.7% homology between the amino acid sequence of Ad41 long 
fiber protein and Ad40 fiber protein in the knob region. 
Further analysis has shown that the long fiber protein gene as 
represented in Pig. 1, starting from the N-terminus (from the left 
in Fig. 1 or from the 5' end of the DNA) is composed of the domains 
20 discussed above and set forth in further detail below. 

(i) **Tail". It is 126 bases long (from base at position 
201 to 326) . On the protein level, it has 42 amino acid residues (frcn 
Met [Methionine] to Pro (Proline] ) . The '"tail" anchors the fiber in 
the penton base on the virion surface and show a high degree of 
25 homology between all adenoviruses. 
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1 (ii) "Shaft". It is 1033 bases long (from base at position 

327 to 1364 ) . On the protein level it has 346 amino acid residues 
(from Gly [Glycine] to Leu [Leucine] ) . The "^shaft" is a structural 
part of the fiber and is composed of repeating units (about 15 amino 
5 acids in each unit) showing high structural (but not sequence) 
homology among all adenoviruses. The number of these repeating units 
determines the length of fiber protein. In the case of Ad5, Ad2 and 
Ad41^ there are 22 such units; in the case of Ad3 and Ad7, 6 units; 
and in Ad40, 21 units. 

10 (iii) '^Knob". It is 525 bases long (from base at position 

1365 to 1889) . On the protein level, it has amino acid residues (from 
Trp [Tryptophan) to Gin (Glutamic acid]). The TAA sequence endipg 
the DNA sequence of the fiber gene (bases 1887-1889) ,is a part of 
the gene, but is not translated into an amino acid; it is a terminatibn 

15 (or nonsense) codon. The '*)cnob" region is responsible for the 
interaction of the virus with cellular receptors and determines the 
specificity of the virus. It differs substantially from adenovirus 
to adenovirusr depending on the types of cells infected by the virus. 

The sequences flan)cing the Ad41 long fiber protein gene 

20 found in Fig. 1 contain various regulatory signals. 

With respect to the previously undiscovered 41.4 kd Ad41 
protein gene and subsequent protein encoded therein, these are 
henceforth characterized in the Specification and Claims as **Ad41 
short fiber protein gene" and ^*Ad41 short fiber protein". 

25 
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1 It was surprisingly found when sequencing the DNA of the 

human enteric adenovirus type 41 Tak genome upstream of the Ad41 
long fiber protein gene, using standard techniques , that an 
open reading frame of 387 amino acids existed coding for the 
5 heretofore undisclosed Ad41 short fiber protein. The fdLrst 42 
amino acids of the Ad41 short fiber protein show a high degree of 
homology both to Ad41 (74%) and Ad2 (61%) 60.6 kd long fiber protein 
tail domains. Furthermore, amino acids 43 to 233 of the short fiber 
protein form a typical shaft domain of twelve 15-residue repetitive 
10 motifs which is in contrast to 22 such repeats found for Ad2, Ad5, 
and the long fiber protein of Ad41 or 6 repeats found for Ad3 and 
Ad7. The knob domain (amino acids 234 to 387) is about 15% shorter 
than found for the aibove mentioned viruses . If this gene p.s expressed, 
Ad41 would resemble avian adenoviruses which were found to haye 
15 two fibers of different length protruding from their pentons. The 
sequence presented in Fig. 2 is from the EcoRV site at map position 
83*1% to the AccI site at map position 87.1%. This region was cloned 
and sequenced in a manner that described above. 

The structure of the short fiber shows the same structural 
20 elements as described for other fiber genes (but not the identical 
sequence), namely: 

(i) ""Tail''. It is 126 bases long (from base at position 
157 to 282) • On the protein level, it has 42 amino acid residues 
(from Met [Methionine] to Pro [Proline]). 
25 • - • (ii) ^Shaft". It is 573 bases long (from base at position 

283 to 855) • On the protein level it has 191 amino acid residues 
(from Gly [Glycine] to lie [ Isoleucine] ) . The short fiber of Ad41 
has 12 repeating units. 
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1 (iii) '*Knob". The short fiber "'knob" of Ad41 is 465 bases 

long (from base at position 856 to base at position 1320) . On the 
protein level, it has 154 amino acids (from Trp (Tryptophane] to 
Gin (Glutaminel ) . The TAA sequence ending the short fiber protein 
5 gene (bases 1318-1320) is a part of the gene, but is not translated 
into an amino acid; it is a termination codon. 

The knob region of the Ad41, short fiber protein is very 
different from the knob region of the long fiber protein as well as 
from knob regions of fiber proteins of other adenoviruses. 

10 This enteric adenovirus (Ad41) is understood to use two 

different receptors on the surface of a cell for binding and/or 
penetration. It is also understood that two different fibers with 
distinct knobs" permit the Ad41 virus to infect ^t least two 
different types of cells in the gastrointestinal tract. Therefor^ 

15 the present invention also relates to diagnostic immunoassays and 
effective vaccines which utilize the different Ad41 fiber proteins 
as discussed in further detail below. 

In addition, the present invention also conteitplates 
another critical sequence, the DNA sequence of the Ad41 E3 region 

20 as shown in Fig. 3. This will be referred to in the Specification 
and Claims as the Ad41 E3 gene. 

In addition, the amino acid sequences of six putative proteins encoded 
by this region are described herein, and referred to in the 
Specification and Claims as RL-1, RL-2, RL-3, RI--4, RL-5 and RL-6 
25 as set* forth in further detail below. 

The Ad41 E3 region DNA sequence has 3373 bases, including 
the flanking regions . The sequence disclosed herein is from the EcoRI 
restriction site at map position 74% to the Espl restriction site 
at map position 83.9%. 
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1 The Acl41 £3 region codes for some unique, previously 

unrevealed proteins. The Ad41 E3 region contains information 
sufficient to code for at least 6 proteins; in the following order 
(from the left, or 5* end): 

3 (1) The region from base 683 to base 1204 codes for a 19. 4 

kd protein, referred to herein as RL-1. This protein has 173 amino 
acid residues. It is unique for Ad41. 

(2) The region from base 1207 to 2037 codes for a 31.6 
kd protein, referred to herein as RL-2. This protein has 276 amino. 

10 acid residues. It is unique for Ad 41. 

(3) The region from base 1730 to 1909 codes for a 6.7 kd 
protein (in a different reading frame than the 31.6 kd protein), 
referred to herein as RL-3. This protein has 59 amino acid residues 
and is unique for Ad41. i 

15 (4) The region from base 2056 to base 2328 codes for a 

10 . 1 kd protein, referred to herein as RL-4 . This protein as 90 amino 
acid residues and shows 40% homology to an Ad 2 10.4 kd protein. 
It was postulated by Carlin, et al.. Call, 52:135-144 (1989) that 
the same protein in Ad2 induces internalization and degradation of 
20 the epidermal growth factor receptors ( EGF-R ) . 

(5) The region from base 2325 to base 2648 codes fcr a 
12.3 kd protein, referred to herein as RL-5 . This protein has 107 
amino acid residues and shows 35% homology to an Ad2 14.5 kd protein; 
the function of the Ad2 protein is unknown. 
25 (6) Finally, the region from base 2641 to base 3009 codes 

for a 14.0 kd protein, referred to herein as RL-6. This protein has 
122 amino acid residues and shows 50% homology to an Ad2 14,7 kd 
protein which was found by Gooding, et al.. Call. 51:341-346 (1938) 
to inhibit cytolysis by the tianor necrosis factor (TNF) . 
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1 The present invention contemplates the use of the 

Ad41 long and short fiber protein genes, and the Ad41 E3 region gene, 
via production of their gene products, to prepare antibodies. Such 
antibodies may be monoclonal or polyclonal. Additionally, it is 

5 within the scope of this invention to include second antibodies 
(monoclonal or polyclonal) directed to the first antibodies discussed 
above . 

Accordingly, the pesent invention relates to a method 
for stimulating an immune response to human adenovirus type 41 Tak 

10 which consists of administering an effective amount of at least one 
of Ad41 long fiber protein, Ad41 short fiber protein, and Ad41 E3 
proteins RL-1, RL-2, RL-3, RL-4, RL-5 and RL-6, under conditions 
as described below, sufficient to cause the production of polyclonal 
or monoclonal antibodies to at least one of said Ad41 proteini, 

15 wherein the dosage effective amount of said Ad41 proteins can be from 

about 0.001 mg to 100 mg. 

In order to produce such antibodies, Ad41 long fiber 
protein, Ad41 short fiber protein or Ad41 E3 proteins (RL-1 to RL- 
6) are first purified, and methods of antibody production are 
20 described below. Both polyclonal and monoclonal antibodies are 
obtainable by immunization with at least one of the above-identified 
proteins or their active components (which, in the case of the fiber 
proteins, can be the tail, shaft or knob) . The methods of obtaining 
both types of antibodies are well known in the art; e.g., extensive 
25 protocols for antibody production can be found in Harlow, et al., 
an^^v^f^rfi^<»• a T.ahnratorv Manual, Cold Spring Harbor, N.Y., 1988. 
Polyclonal antibodies are less preferred, but are relatively easily 
prepared by injection of 
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1 a suitable laboratary animal withr for exanple, 0.001 to 100 mg of 
the pxirif ied viral antigenic component, collecting serum from the 
animal, and isolating specific sera by any of the known immunoadsorbenr 
techniques. Although antibodies produced by this method are 
5 utilizable in virtually any type of imunoassay, they are generally 
less favored because of the potential heterogeneity of the product. 

In another embodiment of the present invention, mono- 
clonal antibodies are contemplated for detection and diagnosis of 
Ad41 and related adenoviruses. 

10 The production of monoclonal antibodies relative to the 

present invention is particularly preferred because of the ability 
to produce monoclonal antibodies in large quantities and the 
homogeneity of the final product. The preparation of hybridoma cell 
lines for monoclonal antibody production derived by fusing an 

15 immortal cell line and lyn:?>hocytes sensitized against the immunogenic 
preparation can be done by techniques which are well known to those 
who are skilled in the art. (See, e.g., Kohler, G. and Milstein, C,, 

Nature 2 56 : 495-497, 1975; European Jonrnal Ttrnnunftln^. £:511- 

519, 1976; the teachings of which are hereiri incoirporated by 
20 reference) . 

Unlike preparation of polyclonal sera, the choice of 
animal is dependent on the availability of appropriate immortal lines 
capable of fusing with lymphocytes thereof. Mouse and rat have been 
the animals of choice in hybridoma technology and are preferably used. 
25 Humane can also be utilized as sources for sensitized lynphocytes 
if appropriate immortalized human (or nonhuman) cell lines are 
available. For the purpose of the present invention the animal of 
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1 choice may be injected with, for exannple, a preferred range from about 
1 mg to about 20 mg of the purified virus or antigenic component 
thereof. (A range of 0.001 mg to 100 mg of purified viral component 
is also contenplated. ) Usually the injecting material is emulsified 
5 in Freund's complete adjuvant. Boosting injections may also be 
required. The detection of antibody can be carried out by testing 
the antisera- with appropriately labeled antigen. Lymphocytes can be 
obtained by removing the spleen or lymphnodes of sensitized animals 
in a sterile fashion and carrying out fusion. Alternately, 

10 lymphocytes can be stimulated or immunized in vitro , as described, 
for exan^le, in C.Reading, J . Immun ol - Meth . 53: 261-291, 1982. 

A number of cell lines suitable for cell fusion , have 
been developed, and the choice of any particular cell line for 
hybridization protocols in the production of monoclonal antibodies 

15 is directed by any one of a nundber of criteria such as speed, 
uniformity of growth characteristics, deficiency of its metabolism 
for a component of the growth medixizn, and potential for a good fusion 
frequency . 

Intraspecies hybrids, paurticular ly between like strains , 
20 work better than interspecies fusions. Several cell lines are 
available, including mutants selected for the loss of ability to 
secrete myeloma immunogluJdulin . Included among these are the 
following mouse myeloma lines: MPC11-X45-6TG, P3-NSl-l-Ag4-l, P3- 
X63-Ag8, or mutants thereof such as X63-Ag8.653r SP2-0-Agl4 (all 
25 BALB/C-derived) , Y3-'Agl.2.3 (rat), and U266 (hxaman) . 
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1 Cell fusion can induced either by virus, such as Epstiein- 

Barr on Sendai virus, or polyethylene glycol. Polyethylene glycol 
(PEG) is the most efficacious agent for the fusion of mammalian 
somatic cells. PEG itself may be toxic for cells, and various 
5 concentrations should be tested for effects on viability before 
attempting fusion- The molecular weight range PEG may be varied from 
IrOOO to about 70% w/w in saline or serum-free medium. Exposure to 
PEG at 37® C for about 30 seconds is preferred in the present case, 
utilizing murine cells. Extremes of temperature (i.e. about 45^ C) 

10 are avoided, and preincubation of each component of the fusion sysvi^em 
at 37^ C prior to fusion gives optimum results . The ratio between 
lymphocytes and malignant cells range of from about 1:1 to about 1:10 
gives good results. , 

The successfully fused cells can be separated from the^* 

15 myeloma line by any technique known by the art. The most common and 
preferred method is to choose a malignant line which is Hypoxanthine 
Guanine Phosphoribosyl Transferase (HCaPRT) deficient, which will not 
grow in an aminopterin containing medium used to allow only growth 
of hybrids and which is generally composed of hypoxanthine IxlO'^M, 

20 aiminopterin IxlO^M, and thymidine 3xlO"*M, commonly known as the HAT 
medium. The fusion mixture can be grown in the HAT-contaihing culture 
medium immediately after the fusion 24 hours later. The feeding 
schedules usually entail maintenance in HAT medium for two weeks and 
then feeding with either regular culture meditam or hypoxanthine, 

25 thymidine containing meditim. 
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1 The growing colonies described above are tested for the 

presence of monoclonal antibodies that recognize the antigenic 
preparation r wherein said antigenic preparation which includes at 
least one of the above-identified Ad41 proteins or a derivative 
5 thereof. Hybridonia antibodies are identified by using an assay where 
the antigen is bound to a solid support and allowed to react to 
hybridoina supernatants containing putative antibodies . The presence 
of antibodies is shown by sandwich" techniques using a variety of 
indicators, as discussed in further detail below. Most of the common 
10 methods are sufficiently sensitive for use in the range of antibody 
concentrations secreted during hybrid growth. 

Cloning of antibody-secreting hybrids can be carried out^ 
after 21-23 days of cell growth in selected medium. Cloning can be 
performed by cell limiting dilution in fluid phase or by directly; 
15 selecting single cells growing in semi-solid agarose. For limiting 
dilution, cell suspensions are diluted serially to yield a 
statistical probability of having only one cell per well. For the 
agarose technique, hybrids are seeded in a semisolid upper layer, 
over a lower layer containing feeder cells. The cglonies from the 
20 upper layer may be picked up and eventually transferred to wells. 
Antibody- secreting hybrids can be grown in various tissue 
culture flasks, yielding supernatants with variable concentrations 
of antibodies. In order to obtain higher concentrations, hybrids may 
be transferred into animals to obtain inflammatory ascites. Antibody- 
25 containing ascites can be harvested 8-12 days after intraperitoneal 
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1 injection. The ascites contain a higher concentration of antibodies 
but include both monoclonals and immunoglobulins from the inflam- 
matory ascites. Antibody purification may then be achieved by, for 
exan^le/ affinity chromatography. The present invention further 
5 contemplates the use of the above^described antibodies in a detection 
assay (immunoassay) for human enteric adenoviruses (Group F) t par- 
ticularly Ad41 and Ad40. 

A wide range of iirmunoassay techniques are availsUDle as 
can be seen by reference to U.S. Patent Nos. 4,016,043, 4,424,279, 
10 4,018,653 and by Harlow, et al., snpra . This, of course, includes 
both single-site and two-site, or ^sandwich", assays of the non- 
conpetitive types, as well as in traditional competitive binding 
assays . Sandwich assays are among the most useful and f ommonly used 
assays and are favored for use in the present invention. A number 
13 of variations of the sandwich assay technique exist, and all are 
intended to be enccxipassed by the present invention. 
In a typical forward assay, an unlabeled antibody is inntobilized in 
a solid substrate and the saiqple to be tested brought into contact 
with the bound molecule. After a suitable period of incubation, for 
20 a period of time sufficient to allow formation of an antibody-antigen 
binary con^lex, a second antibody, labeled with a reporter molecule 
capable of producing a detectable signal is then added and incubated^ 
allowing time sufficient for the formation of a ternary complex of 
antibody-labeled antibody. Any unreacted material is washed away, 
25 ^nd.. the presence of the antigen is determined by observation of the 
visible signal produced by the reporter molecule. The results may 
either be 
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1 qualitative, by simple observation of the visible signal, or may be 
quantitated by comparing with a control sample containing known 
amounts c£ hapten. 

Variations on the forward assay include a simultaneous 
5 assay, in which both sample and labeled antibody are added 
simultaneously to the bound antibody, or a reverse assay in which 
the labeled ^antibody and sample to be tested are first combined, 
incubated and then added to the unlabeled surface bound antibody . 
These techniques are well known to those skilled in the art, and the 

10 possibility of minor variations will be readily apparent to those 
skilled in the art- 

As used herein, '^sandwich assay" is intended to encompass 
all variations on the basic two-site technique. For esfample, these 
antibodies may be used to detect Ad41 by its long and/or short fibeir 

15 proteins or any one of E3 proteins RL-1 to RL-6 or other antigenically 
related adenoviruses (i.e., Ad40) by use of specific antigenic 
determinants, or parts thereof {i.e., Ad41 fiber proteins, or the 
tails, shafts or knobs of said proteins) as immobilized iimunoadsorbants. 
Serum is obtained from subjects to be tested and said serum contacted 

20 to the immobilized viral immunoadsorbants . If said serum contains 
antibodies to said iromunoadsorbants, an antibody-adsorbant conjugate 
will result. After removing excess serum and non-bound antibodies^ 
a second antibody specific to a first antibody, said first antibody 
being capable of forming a conjugate with said immunoadsorbant, is 

23 added t-hus resulting in a double antibody-adsorbant conjugate. This 
double antibody-adsorbant conjugate will only result if the test 
serum contains antibodies to the immunoadsorbant . Consequently, 
standard detection techniques can be used to identify the conjugate. 
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1 In another iitmunoassay, the con^etitive binding assay ^ 

a limiting amount of antibody specific for the molecule of interest 
(either an antigen or hapten) is combined with specific volumes of 
solutions containing an unknown amount of the molecule to be detected 
5 and a solution containing a detectably labeled known amount of the 
molecule to be detected or an analog thereof. Labeled and unlabeled 
molecules then corrpete for the available binding sites on the 
antibody. Phase separation of the free and antibody-bound molecules 
allows measurement of the amount of label present in each phase, thus 
10 indicating the amount of antigen or hapten in the sample being tested. 
A number of variations in this general competitive binding assay 
currently exist. 

In any of the known immunoassays, for practical purposes, 
one of the antibodies to the antigen (Ad41 long fiber protein, Ad4j; 
15 short fiber protein or any one of Ad41 E3 proteins RL-1 to RL-6 or 
fragments thereof) will be typically bound to a solid phase and a 
second molecule f either the second antibody in a sandwich assay ^ or 
in a competitive assay, the known amount of antigen, will bear a 
detectable label or reporter molecule in order to allow visual 
20 detection of an antibody-antigen reaction. When two antibodies are 
employed, as in the sandwich assay, it is only necessary that one 
of the antibodies be specific for, e.g., Ad41 short or long fiber 
protein or its antigenic fragments (the tail, the shaft or the knob) . 
The following description will relate to a discussion of a typical 
23 forward sandwich assay; however, the general techniques are to be 
understood as being applicable to any of the contemplated 
inmunoassays . 
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1 In the typical forward sandwich assay, a first antibody 

having specificity for, e.g., Ad41 short or long fiber protein or 
its antigenic fragments is either covalently or passively bound to 
a solid surface. The solid surface is typically glass or a polymer, 
5 the most commonly used polymers being cellulose, polyacrylamide, 
nylon, polystyrene, polivinyl chloride or polypropylene. The solid 
supports may- be in the form of tubes, beads, discs or microplates, 
or any other surface suitable for conducting an immunoassay. The 
binding processes are well-known in the art and generally consist 

10 of cross-linking, covalently binding, or physically adsorbing the 
molecule to the insoluble carrier. Following binding, the polymer- 
antibody complex is washed in preparation for the test sample. An^ 
aliquot of the sample to be tested is then added to th^ solid phase 
complex and incubated at 25** C for a period of time sufficient to allowi 

15 binding of any subunit present in the antibody. The incubation period 
will vary, but will generally be in the range of about 2-40 minutes. 
Following the incubation period, the antibody subunit solid phase 
is washed and dried and incubated with a second antibody specific 
for a portion of the hapten. The second antibody is linked to a 

20 reporter molecule which is used to indicate the binding of the second 
antibody to the hapten. 

By "reporter molecule", as used in the present specifi- 
cation and claims, is meant a molecule which, by its chemical nature, 
provides an analytically identifiable signal which allows the 

25 detection of antigen-bound antibody. Detection may be either 
. qualitative or quantitative. The most commonly used reporter 
molecules in this type of assay are either enzymes, fluorophores 
or radibnucleotide containing molecules. 
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1 In the case of an enzymlc immunoassay (EIA) , an enzyme is 

conjugated to the second antibody^ generally by means of glutaraldehyde 
or periodate. As will be readily recognized, however, a wide variety of 
different conjugation techniques exist, which are readily available to 
5 the skilled artisan. Coninonly used enzymes include horseradish 
peroxidase, glucose oxidase, beta-galactosidase and alkaline phosphates, 
among others. The substrates to be used with the specific enzymes are 
generally chosen for the production, upon hydrolysis by the corresponding 
enzyme, of a detectable color change. For example, p-nitrophenyl 
10 phosphate is suitable for use with alkaline phosphatase conjugates; and 
for peroxidase conjugates, 1, 2-phenylenediamine, 5-aminosalicyclic 
acid, or tolidine are commonly used. It is also possible to employ 
fluorogenic substrates, which yield a fluorescent product rather tlian 
the chroroogenic substrates noted above. ^ 
15 In all cases, the enzyme-labeled antibody is added to the 

first antibody hapten complex, allowed to bind, and then excess 
reagent is washed away. A solution containing the appropriate 
substrate is then added to the ternary con5>lex of antibody-antigen- 
antibody. The substrate will react with the enzyme linked to the 
20 second antibody, giving a cpialitative visual signal, which may be 
further quantitated, usually spectrophotometrically, to give an 
indication of the amount of hapten which was present in the sample. 

Alternately, fluorescent compounds, such as fluorescein 
and rhodamine, may be chemically coupled to antibodies without 
25 altering their binding capacity. When activated by illumination with 
light of a particular 



30 



35 



wo 91/08310 



-25- 



PCr/US90/06887 



1 wavelength, the f luorochrome-labeled antibody absorbs the light 
energy, inducing a state of excitability in the molecule, followed 
by emission of the light at a characteristic color visually 
detectable with a light microscope. As in the EIA the fluorescent 
5 labeled antibody is allowed to bind to the first antibody-hapten 
complex. After washing off the unbound reagent, the remaining ternary 
complex is then exposed to the light of the appropriate wavelength, 
the fluorescence observed inicates the presence of the hapten of 
interest. Immunofluorescence and EIA techniques are both very well 
10 established in the art and are particularly preferred for the present 
method. However, ether report molecules, such as radioisotope, 
chemiluminescent of bioluminescent molecules, may also be employed.^ 
It will be readily apparent to the skilled technician how to vary 
the procedure to suit the required purpose . It will also be apparent 
15 that the foregoing can be used to detect directly or indirectly (i.e., 
via antibodies) Type F adenoviruses. 

In a preferred embodiment, the present invention also 
contemplates the use of the Ad41 S3 proteins RL-1 to RL-6 and Ad41 
short fiber protein knob and Ad41 long fiber protein knob in enzyme 
20 immunoassays for selective detection of human enteric adenoviruses 
and in particular Ad41 and Ad40 in the stool of patients with 
gastroenteritis. EIA can give a clear, rapid result in about 2 hours 
and can therefore be more convenient and efficient and less expensive 
than a DNA probe test. 

25 
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1 The present invention further conuemplates an ELISA 

(enzyme-linked ixnnunoabsorbent assay) test for the presence of 
antibodies to Ad41 long or short fiber protein or Ad41 E3 proteins 
RL-1 to RL-6 in serum or other specimens, such as saliva or the 
5 duodenal fluid from patients with gastroenteritis. The Ad41 long or 
short fiber protein **lcnob" of the present invention can be used/ for 
example r to -coat microtiter plates. 

The present invention also contemplates the use of 
recombinant DNA molecules which contain at least one of the following 
10 genes: Ad41 long fiber protein gene, Ad41 short fiber prot.ein gene^ 
Ad41 E3 region gene encoding for proteins RL-1, RL-2, RL-3, RL-4, 
RL-5 or RL-6. The present invention contemplates using these 
recombinant DNA molecules in the development of diagnostic assays 
for Ad41. In another embodiment, the present invention contemplate? 
15 the use of recombinant DNA molecules or derivatives thereof as 
described above, to . generate antibodies usefiil in diagnostic and 
therapeutic techniques. 

Another aspect of the present invention is the employment 
of the genetic information contained in the DNA of the Ad41 long fiber 
20 protein gene, the Ad41 short fiber protein gene and the Ad41 23 gene. 
As defined herein, DNA is referred to as the genetic component of 
the virus (i.e., double-stranded DNA) . Said DNA can be inserted in 
recombinant expression molecules such that, for example, the Ad4i 
long fiber protein gene encoded thereon is transcribed and the produce 
25 can then be obtain€Ki. Such products can then be used as antigenic 
components to generate, for example, antibodies. The present 
invention conten^lates the 

30 



35 



wo 91/08310 



27 



PCT/US90/06887 



1 trans formatian of a host cell or organism with dsDNA of Fig. 1 (Ad41 
long fiber protein gene) and/or Fig. 2 (Ad41 short fiber protein gene) 
and/or Fig. 4 (Ad41 E3 gene) which is capable of producing Ad41 (long 
or short) fiber protein or Ad41 E3 (RL-1 to RL-6) proteins wherein 
5 the host cell or organism is a bacterium (e.g., E. COli) / yeast, insect 
cell or a mammalian cell. 

The present invention also relates to DNA described 
above which can be used to generate probe nucleic acids for 
hybridization to homologous Ad41 or Ad40 DNA sequences, utilizing 

10 at least one of the following Ad41 genes: Ad41 long fiber protein 
gene, .^d41 short fiber protein gene or Ad41 E3 gene encoding the 
RL-1 to RL-6 Ad41 proteins. 

Another aspect of this invention relates to a recombinant 
nucleic acid or an isolated nucleic acid molecule, said molecules 

15 defined herein to be dsDNA or recombinant DNA encoding Ad41 short 
fiber protein, Ad41 long fiber protein^ or E3 proteins RL-1 to R- 
6, or parts thereof. In one Mtbodiment the recombinant nucleic acid 
molecule is coit5>lementary DNA (cDNA) . It is considered within the 
scope of the present invention to include the cDNA molecule encoding 

20 the above-identified Ad41 proteins, or to regions or parts thereof 
including any base deletion, insertion or substitution or any other 
alteration with respect to nucleotide sequence or chemical 
composition (e.g. methylation and glycosylation) . Additionally, the 
present invention is directed to restriction fragments and synthetic 

25 fragments from a nucleic acid encoding the above-identified Ad41 
proteins. Moreover, another embodiment of the this invention is 
directed to the genomic Ad41 long fiber protein gene, Ad41 
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1 short fiber protein gene or E3 gene, which may include recombinant 
clones like cosmids encoding the entire gene or subclones encoding 
any region of the above-identified genes . Recombinant DNA encoding 
such subregions of the gene are useful as hybridization probes to 
5 detect the presence of the above-identified genes. 

Methods considered useful in obtaining recombinant Ad41 
cDNA are contained in Maniatis, et al.. Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory, New York; (2d Ed. 
1989) , for example, or any of the myriads of laboratory manuals on 
10 recombinant DNA technology which are widely available. Maniatis, et 
al. further discloses how to obtain deletions and insertions by site- 
directed mutagenesis, and subsequent selection of mutants for 

activity . * 

In a preferred embodiment, the present invention provides 

15 a dsDNA or recontoinant DNA or cDNA having a nucleotide sequence 
encoding the Ad41 long fiber protein gene, as shown in Fig. 1. This 
sequence encodes the 60 .6 kd Ad41 long fiber protein having the amino 
acid sequence shown in Fig. 1. 

The present invention fiirther provides a dsDNA or 

20 recombinant DNA or cDNA having a nucleotide sequence encoding the 
Ad41 short fiber as shown in Fig. 2, wherein this sequence encodes 
a 41.4 kd Ad41 short fiber protein having an amino acid sequence as 

shown in Fig. 3. 

The present invention additionally provides a dsDNA or 
25 recombinant DNA or cDNA having a nucleotide sequence which encodes 
for the £3 region of Ad41 as shown in Fig. 4 wherein this region encodes 
six E3 proteins, RL-1 to RL-6. The E3 DNA sequence, from base 683 
to base 1204, encodes RL-1, a 
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1 19.4 kd protein having an amino acid sequence as shewn in Fig. 5. 
The E3 DNA sequence from base 1207 to base 2037 encodes RL-2, a 31.6 
kd protein having an amino acid sequence as shown in Fig. 6. The E3 
DNA sequence from base 1730 to base 1909 encodes RL-3, a 6.7 kd protein 
5 having an amino acid sequence as shown in Fig. 7. The E3 DNA sequence 
from base 2056 to base 2328 encodes a 10.1 kd protein having 

an amino acid sequence as shown in Fig. 8 The £3 DNA sequence from 
base 2325 to base 2648 encodes RL-5r a 12.3 kd protein having an amino 
acid sequence as shown in Fig. 9. The S3 DNA sequence from base 2641 

10 to base 3009 encodes RL-6/ a 14.0 kd protein having an amino acid 
sequence as shown in Fig. 10. 

The present invention further contemplates the prepa- 
ration and use of a vaccine composition for the treatment of human 
adenovirus type 41 and related adenoviirusesr including Ad40 . The 

15 preparation of said vaccine is accomplished by utilization of at least 
one of the following adenovirus type 41 proteins: Ad41 short fiber 
protein, Ad41 long fiber protein, and E3 proteins RL-1 through RL- 
6 . This is done by genetic engineering of at least one of the above- 
identified proteins and expressing at least one of these proteins 

20 in suitable vector/host cell systems such as bacteria, yeast or any 
other suitable vector /host system. In a further preferred embodiment, 
the vaccine of the present invention contemplates the use of cloned 
Ad41 long fiber protein ^^knob" or short fiber protein "*knob" as an 
immunizing agent. 

25 ' ' Previously used vaccines have generally con^rised (I) an 

attenuated live virus type of vaccine in which the virus has been 
rendered avirulent but not killed by some form 
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1 of genetic attenuation; or (II) specific viral components isolated 
and purified from the virus and inactivated by formalin or some other 
chemical or physical treatment. The present invention contemplates 
conventional Type II vaccines, wherein the specific viral components 
5 isolated and purified from the virus and inactivated by formalin or 
other treatments are contemplated to be at least one of Ad41 shorz 
fiber protein, AD41 long fiber protein, E3 Rl-1, RL-2, RL-3, 
RL-5 or RL-6 protein. In addition, with respect to Ad41 long and 
short fiber protein ^^viral component" also contemplates at least one 
10 of the tail, shaft or knob of these proteins. The present invention 
also contemplates the preparation of recombinant Ad41 proteins for 
use in a vaccine against Ad41 and Ad40* 

In another embodiment, the present invention is directed 
to a Type II vaccine which is a combination of inactivated Ad41 and i 
15 at least one of recombinant long and short Ad41 protein fibers and 
Ad41 E3 proteins RL-1 to RL-6. 

By vaccine is meant an agent used to stimulate the immune 
system of a living organism so that protection against future harm 
is provided. Administration of a vaccine contemplated by the present 
20 invention to the patient (or animal) may be by any known or standard 
techniques. These include oral ingestion, intestinal intubation, or 
broncho-nasal spraying. Other methods of administration, such as 
intravenous injection, that allow the carrier microbe to reach the 
human or animal's bloodstream may be acceptable when the carrier 
25 microbe- is unable to reproduce. 
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1 Recombinant DMA techniques for the preparation of 

recombinant Acl41 proteins for use in the preparation of vaccines are 
sufficiently well known and widespread so as to be considered routine . 
In very general and broad terms, a method for use herein cosists of 
5 transferring the genetic material, or more usually part of the genetic 
material, of one organism into a second organism so that the 
transferred genetic material becomes a permanent part of (recombines 
with) the genetic material of the oganisms to which it is transferred. 
This usually consists of first obtaining a small piece of DNA from 

10 the parent organism either from a plasmid or a parent chromosome. 
A plasmid (also called an extrachromosomal element) is a hereditary 
unit that is physically separated from the chromosome of the cel]^. 
The DNA may be of any size and is often obtained by t^he action of 
a restriction endonuclease enzyme which acts to split DNA moleculds 

13 at specific base-pair sites. In the present invention an Ad41 long 
fiber protein gene can be obtained which is a 1.9 kb SmaI-£coRI DNA 
fragment or an Ad41 short fiber protein gene which is an £coRV*AccI 
DNA fragment or an Ad41 £3 sequence which is an £coRI-£5pI DNA 
fragment. The DNA pieces of the Ad41 protein gene may be transferred 

20 into a host cell by various means such as transformation (uptake of 
naked DNA from the external environment, which can be artificially 
induced by the presence of various chemical agents, such as calcium 
ions) . Other methods such as transduction are also suitable, wherein 
the DNA is packaged within a phage such as the co-called cosmid vector . 

25 Once 'th6 parent DNA is in the carrier cell, it may continue to exist 
as a separate piece (generally true of complete transmitted plasmids) 
or it may insert into the host cell chromosome and be reproduced with 
the chromosome during cell division. 
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1 Transferring genetic materials is relatively straight- 

forward. Any method capable of producing recombinant organisms 
comprising genes from pathogenic organisms that are expressed in 
avirulent microbes will suffice. The techniques of DNA isolation^ 
5 gene cloning, and related techniques are disclosed in great detail 
in, for example, R^f^ombinanti DNA. Methods of Enzymology^ Volume 68, 
Ray Wu, ed./ Academic Press (1979), and Maniatiis, T,, et al,, 
^nlAf-nlar Cloning . Cold Spring Harbor Laboratories (1982) , which are 
herein incorprated by references and are applicable to the Ad41 
10 protein gene of the present invention. 

Vaccines of the present invention may be administered 
either as a liquid or in enteric-coated capsules .Such preparations 
are resistant to acid and enzymes in the stomach of tlje inoculated 
animal while dissolving in the intestines . Various enteric-coating^ 
15 are known in the art, for example, as disclosed in U.S. Patent 
Nos.3,241,520 and 3,253,944 and are commercially available. A method 
suitable for preparation of enteric-coated capsules is described in 
U.S. Patent Ito. 4,152,415, which is herein incorporated by reference, 
and can be easily modified to provide capsules containing the carrier 
20 microbes of the present invention. 

Vaccines of the present invention may be administered 
orally in enteric-coated capsules as described above or may be 
administered parenterally (e.g. , by intraimiscular, subcutaneous, or 
in-iravenous injection) . The amount required will vary with the 
25 antigenicity of the gene product and need only be an amount sufficient 
to induce an immune response typical of existing vaccines. Routine 
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1 experimentation will easily establish the required amount. Typical 
initial dosages of vaccine could be about 0.001-100 mg antigen/kg 
body weight, with increasing amounts or multiple dosages used as 
needed to provide the desired level of protection. 
5 The pharmaceutical carrier in which the vaccine is 

suspended or dissolved may be any solvent or solid that is non-toxic 
to the inoculated animal and compatible with the carrier organism 
or antigenic gene product. Suitable pharmaceutical carriers include 
liquid carriers, such as normal saline and other non-toxic salts at 

10 or near physiological concentrations, and solid carriers, such as 
talc or sucrose. Adjuvants, such as Freund's adjuvant, complete or 
incomplete may be added to enhance the antigenicity via the bronchial^ 
tubes, the vaccine is suitably present in the form of an aerosol. 
Booster iitmunizations may be repeated numerous times with beneficial/ 

15 results . 

in a preferred embodiment, the present invention 
contemplates a vaccine specific to Ad41 long fiber protein or at least 
one of its active fragments, e.g., the tail, the shaft or the knob 
of the long fiber protein, a vaccine specific to Ad41 short fiber 

20 protein or at least one of its active fragments, or a vaccine specific 
to at least one of the proteins of the Ad41 E3 region, RL-1 to RL-6. 

A number of viral polypeptide preparations derived from 
viral coats or envelopes have been suggested as possible active 
components for vaccine compositions. For example, U.S. Patent No. 

25 4,470,967 describes vaccine preparations which are made by complexing 
viral polypeptide with a lectin, the latter element acting as 
adjuvant. A number of 
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1 references^ e.g.r 4,344^935 or 4,356,169 or Morein, et al., J- Gen. 
Virol . , 64 ! 1557-1569 r 1983, utilize a method of preparation of 
parainfluenza glycoprotein compositions in which the viral glyco- 
protein KN and F are solubilized with a detergent, to extract them 
5 from the viral envelope, followed by some method of phase separation 
in order to remove the detergent and lipids. The latter procedures 
produce a glycoprotein siabunit which is not only detergent free, but 
also lipid free. The latter type of highly purified glycoprotein is 
considered the preferred type of active agent for potential use of 
10 commercial vaccine. 

In another aspect, the present invention relates to a 
method of treating infectious diseases caused by Ad41 and other 
related adenoviruses such as Ad40. 

The subject invention also encompasses antibodies, 
15 either monoclonal or polyclonal, which are useful in the therapeutic 
control of infection by adenoviruses and in particuliar, Ad41 or Ad40 . 
Said antibodies can be prepared as described above and by injecting 
mammalian species, e.g., human, horse, rabbit, sheep, mice, etc. with 
inactivated virus or derivatives thereof (i.e., the tail, shaft or 
20 knob) and then purifying said antibodies employing the detection 
systems contemplated and described herein. 

In another embodiment, the present invention relates to 
the development of specific human or other eukaryotic (e.g., yeast, 
baculovirus, or Chinese hamster cells) polyclonal or monoclonal 
25 antibodies, as well as human-mouse chimeric polyclonal or monoclonal 
antibodies for administration in passive immunization against human 
adenoviruses, and in particular, Ad41 and Ad40. Immunization 
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1 refers to the process of inducing a continuing high antibody level in 
an organism i.e. , in humans, which is directed against an antigen to which 
the organism has been previously exposed. 

Passive immunization, as defined herein, refers to 
5 resistance (e.g., temporary or sustained protection against 
infection) based on giving preformed antibodies to a patient from 
an in vivo ., or in vitro source. The main advantage of passive 
immunization is the prompt availability of large amounts of 
antibodies against human adenoviruses as described in the aibove 
10 embodiment of the present invention. 

A chimeric antibody, as defined herein, is an antibody 
molecule made by recombinant DNA technology involving irrenunoglobulin 
genes of two different species. The human-mouse chimeric antibody 
is produced by combining the Fab portion of the mouse imraunoglobuldn 
15 gene and the Fc portion of the h\aman immunoglobulin gene by recombinant 
DNA technique. The prodution of human-mouse chimeric antibodies is 
advantageous since large amounts of antibodies can be produced by this 
system and human-mouse chimeric antibodies can be recognized by cells 
of the human immune system whereas non-chimeric antibodies would not be 
20 recognized as easily by cells (e.g., phagocytic) of the htanan immune 
svstem. The chimeric antibodies can be produced in large amounts in the 
mouse system and can recognize human adenoviruses as contemplated in the 
present invention. Human-mouse immunoglobulins have also been found to 
make large amounts of antibodies in yeast and this system is also 
25 contemplated herein. The following references discuss the methodologies 
for producing such antibodies and are incorporated herein by reference: 
Morrison, al. . P.N.A.S. . M:6851 (1984); Horowitz, et al. , P . N , ft , S 
15:8678" (1988); and Tao, et aT...T Tmmunol,, 113:2595 (1989). 
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1 The present invention also provides a kit for production 

of recombinant viral components of at least one of the above- 
identified Ad41 genes, to produce a vaccine to Ad41 or related viruses 
such as Ad40. 

5 The present invention further contemplates the use of 

probes to detect hybridization, cellular DNA from infected tissue 
{e.g. biopsy -material) carrying intergrated structural Ad41 DNA 
(i.e. ^ of the Ad41 long or short fiber protein gene or Ad41 E3 gene) . 
The probe can be DNA, cDNA, recombinant DNA or RNA, The present 
10 invention further contemplates a kit for detection of viral 
components of Ad41 or Ad40. 

In one particular embodiment of the present invention, 
patient specimens (tissue or tissue extracts) contap.ning biopsy 
material are smeared onto a standard microscope slide, then fixed wit); 
13 an appropriate fixative . The DNA or RNA probe, which has been labeled 
(e.g. with biotin-avidin-enzyme) is added. The slide is then placed 
onto a heating block for one or two minutes to allow both the probe 
and the target nucleic acids to be separated from their complementary 
strand (if double stranded) • Non-hybridized probe DNA or RNA is 
20 removed by gentle washing. After a suitable detection complex is 
added, hybridization is detected with a light microscope following 
formation of a colored compound. Alternatively, the probe nucleic 
acid is labeled with a radioactive isotope and tissue to be tested 
lysed and their DNA fixed to, for example, nitrocellulose paper. 
25 Hybridization and DNA/RNA detection systems are well known in the 
art . 

In a further embodiment, the present invention also 
relates to a kit for the detection of Ad41 long and/or short fiber 
protein and its active fragments and fiber protein of related 
30 adenoviruses and/or Ad41 £3 region proteins, the kit being 
compartmentalized to receive a first container adapted 
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1 to contain an antibody having specificity for Ad41 long and/or short 
fiber protein or fragments thereof or Ad41 E3 region proteins, and 
a second container containing an antibody specific f cr first antibody 
and being labeled with a reporter molecule capable of giving a 
5 detectable signal. If the reporter molecule is an enzyme, then a third 
container, containing a substrate for said enzyme is provided. 

In another embodiment, the present invention contem- 
plates pharmaceutical compositions containing at least one of the 
above-identified Ad41 proteins, or derivatives thereof, for 
10 treatment of Ad41 or related viruses such as Ad40. The dosage 
s ■ '^c^ ve amount or such compounds is from about 10 mg to about 100 
mg per kg body weight. 

The DNA sequence comprising the full-lenght, 60.6 kd Ad41 
(Tak) long fiber protein has been deposited with the Exiropeai? 
15 Molecular Biology Laboratory (EMBL) database and accorded the 
accession number X16583. 

The DNA sequence comprising the full-length 41.4 kd Ad41 
short fiber protein has been deposited with the EMBL database and 
accorded the accession number X17016. The Ad41 E3 DNA sequence has 
20 been deposited with the GenBank database an accorded the accession 
number M33160. 

EXAMPLES 

1. r^l 1 Q and virus 

25 - - ' Monolayer cultures of HEp-2, HeLa, Human Intestine 

{HI407), and Graham-293 cell lines were grown in Dulbecco' s modi- 
fication of Eagle minimal essential medium containing 10% fetal 
bovine serum (FBS) . 293 cells were obtained from Flow Laboratories 
as well as ATCC; all other cell lines were from ATCC. The adenovirus 

30 type 41 (Ad41) strain Tak (prototype strain 73-3544 - ATCC #VR-930 
) used was provided by Dr. Jan C. de Jong, Bilthoven, The Netherlands, 
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1 and originally passaged by him in HeLa (pi), Hep-2 (p4) and HeLa (p4) . 
Detailed methods for growth and analysis of Ad41 were performed as 
described in Pieniazek et. al. Virology, 12Ai 239-249 (1990 ) . 

3 2. TcsnlaMon viral DNA 

A modification of the method of Hirt, iT. - Qiol. . ia: 
365-369 (1967) was used. Monolayers of cells, grown in 25 cm2 flasks, 
are inoculated with the virus. After 2 hours the solution was 
discarded and mediiam containing 5% FBS was added. The cultures were 
10 incubated at 37** C for up to 15 days or until maximal CPE could be 
observed. The cells to be analyzed were scraped into the culture fluid 
and centrifuged at 1000 x g for 5 min. The pellet was suspended in 
0.5 ml of 1 X SSPE buffer, pH 7.4, per flask. EDTA and SPS were added 
to the final concentration of 50 mM and 1%, respectively. The lysat^^ 
15 was allowed to stand 20 min. at room temperature, then NaCl was added 
to 1 . 0 M and the sample was incubated at 4"* C for at least 1 hr . The 
high-molecular weight DNA and cell debris was pelleted by spinning 
the lysate for 15 min. in an Eppendorf centrifuge. Tl RNase was added 
to the clarified supernatant to a final concentration of 25 ug/ml. 
20 After incubation for 30 min. at 37^ C proteinase K (Boehringer- 
Mannheim) was added to 200 um/ml and the sanple was further incubated 
for 30 min. as above. The proteins were removed by one extraction 
with saturated phenol and one by phenol/ chloroform mixture (1:1 v/ 
v) according to the method of Maniatis et al., MoT ftcular Cloning; 
25 A t.g»lv Manual . Cold Spring Harbor, NY (1982) . DNA was precipitated 
with 3 volumes of ethanol. Nucleic acid, prepared from one culture 
flask was suspended in 50 ul of TE buffer (10 mm Tris-HCl^ 1 mM EDTA, 
pH 7; 5) and stored at 4"* C. 
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1 3. r Inning Ad41 F.rnRT band B, 

Restriction enzyme EcoRI was ptirchased from BRL and is 
used according to manufacturer's specifications. Briefly, 3 ul of 
sample was digested at 37** C with 5 units of enzyme in a final volume 
5 of 10 ul. Nucleic acid fragments were separated by electrophoresis 
on 1% agarose gels (3ioRad) and the EcoRI band was identified by 
ethidium bromide staining. An agarose fragment containing this band 
was excised from the gel and the DNA was recovered using the GENECLEAN 
kit (Bio 101, La Jolla, CA.) . This isolated DNA fragment was mixed 
10 withEcoR-digestedplasmidpBluescipt II SK(+) (Stratagene, La Jolla, 
CA.) and treated with phage T4 DNA ligase (BRL) , Next, competent cells 
of Z. coli strain XL-1 Blue (Stratagene) were transformed with this 
ligation mixture and a clone containing Ad41 EcoRI band E was selected 
by estimating the size of the insert and restriction enzyme mappings. 

15 

Preliminary sequencing was accomplished using the method 
Deininger, i^r^^^yir. aiochem-.. 135; 247-263 (1983). Ad41 EcoRI band 
B was isolated from an agarose gel as above and sheered by sonication. 

20 The ends of the sheered fragments were then filled with T4 DNA 
polymerase and the fragments were cloned into the Smal site of the 
M13n5>18 phage vector. Individual M13 clones were sequenced using the 
Sequenase )cit from USB (Cleveland, OH) . DNA sequences were analyzed 
using the IBI/Pustell software package from IBI (New Haven, CT) and 

25 their Gel Reader sonic digitizer. 

After locating the start and end of the fiber gene by 
homology to the published Ad5 fiber sequence (Chroboczek and Jacrot, 
Viroloay. 549-554, 1987), Ad41 long fiber gene was sequenced 

using a modified approach. Custom 
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1 oligonucleotide primers were used in a double-stranded DNA sequencing 
protocol according to the Sequenase Version 2.0 manual (in the 
Sequenase kit from USB, Clevelans, OH) . The complete secjuence of the 
Smal - EcoRI fragment (map position 86.4% to 92%), shown in Fig. 1, 
5 was assemble from fragment obtained by sequencing both strands 
including sequencing in the presence of dITP to resolve problems with 
compressions- of the DNA. 

The same method as described above was utilized for 
sequencing the Ad41 short fiber gene^ and the complete sequence of 
10 the EcoRv-AccI fragment (map position 83.1% to 87.1%) is shown in 
Fig. 2. The Ad41 £3 region DNA was also sequenced in similar fashion, 
and the complete sequence of this EcoRT-EspI fragment (map position 
74% to 83.9%) is shovm in Fig. 4. 

The protein coding regions of E3 DNA, short fiber DNA and 
15 long fdlber DNA of human adenovirus type 41 Tak, are shown by the 
proteins RL-1 to RL-6, short fiber protein and long fiber protein 
as illustrated in the map of Fig. 11. 

20 
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1 1. An isolated nucleic acid encoding a protein selected from 

human adenovirus type 41 Tak long fiber protein, short fiber protein, £3 
RL-1 protein, E3 RL-2 protein, E3 RL-3 protein, E3 RL-4 protein, E3 RL- 
5 protein or E3 RL-6 protein. 

5 2. The nucleic acid of Claim 1 wherein said nucleic acid 

is DNA, cDna, recombinant DNA or RNA. 

3 . The nucleic acid according to Claim 1 or 2 having a 
nucleotide sequence of the human adenovirus Type 41 Tak long fiber 
protein gene and which comprises: 

10 

ATG AAA CGA GCC AGA CTT GAA GAT GAC TTC AAC CCC 
TAC TTT GCT CGG TCT GAA CTT CTA CTG AAG TTG GGG 

GTC TAC CCT TAC GAA CAC TAC AAT CCC CTT GAC ATC CCA TTT ATT ACA CCC [ ' 
15 CAG ATG GGA ATG CTT GTG ATG TTA GGG GAA CTG TAG GGT AAA TAA TGT GGG 

CCG TTT GCC TCC TCC AAC GGC TTG CAA GAA AAA CCA CCG GGA GTC CTC AGC 
GGC AAA CGG AGG AGG TTG CCG AAC GTT CTT TTT GGT GGC CCT CAG GAG TCG 

20 CTG AAA TAC ACT GAT CCA CTT ACA ACC AAA AAC GGG GCT TTA ACC TTA AAA 
GAC TTT ATG TGA CTA GAA GTA TGT TGG TTT TTG CCC CGA AAT TGG AAT TTT 

CTG GGC ACG GGA CTA AAC ATT GAT GAA AAT GGA GAT CTT TCT TCA GAT GCT 

GAC CCG TGC CCT GAT TTG TAA CTA CTT TTA CCT CTA GAA AGA AGT CTA CGA 

25 

AGC GTG GAA GTT AGC GCC CCT ATT ACT AAA ACC AAC AAA ATC GTA GGT TTA 

TCG CAC CTT CAA TCG CGG GGA TAA TGA TTT TGG TTG TTT TAG CAT CCA AAT 

AAT TAC ACT AAA CCT CTC GCC CTG CGA AGT AAC GCG CTC ACT CTT TCT TAC 
30 TTA ATG TGA TTT GGA GAG CGG GAC GCT TCA TTG CGC GAG TGA GAA AGA ATG 

AAC GCA CCC TTA ACC GTA GTA AAT AAC AAT TTA GCT TTA AAT ATC TCA CAA 
TTG CGT GGG AAT TTG CAT CAT TTA TTG TTA AAT CGA AAT TTA TAG AGT GTT 

35 CCT GTC ACT GTT AAT GCA AAC AAC GAA CTT TCT CTC TTA ATA- GAC GCC CCA 
GGA CAG TGA CAA TTA CGT TTG TTG CTT GAA AGA GAG AAT TAT CTG CGG GGT 
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1 CTT AAT GCT GAC ACG GGC ACT CTT CGC CTT CAA AGT GCT GCA COT CTT GGA 
GAA TTA CGA CTG TGC CCG TGA GAA GCG GAA GTT TCA CGA CGT GGA GAA OCT 

CTA GTG GAC AAA ACA CTA AAA GTT TTG TTT TCT AGO CCC CTC TAT CTA GAT 
5 GAT CAC CTG TTT TGT GAT TTT CAA AAC AAA AGA TCG GGG GAG ATA GAT CTA 

AAT AAC TTT CTT ACA CTA GCC ATT GAA CGC CCG CTA GCT CTA TCC AGT AGC 
TTA TTG AAA GAA TGT GAT CGG TAA CTT GCG GGC GAT CGA GAT AGG TCA TCG 

10. AGA GCA GTG ACC CTT AAG TAT TCA CCA CCT TTA AAA ATA GAA AAC aAA AAC 
TCT CGT CAC TGG GAA TTC ATA AGT GGT GGA AAT TTT TAT CTT TTG CTT TTG 

TTA ACC CTA AGC ACA GGC GGG CCT TTT ACT GTA AGC GGG GGA ^ AAT CTA AAC 

AAT TGG GAT TCG TGT CCG CCC GGA AAA TGA CAT TCG CCC CCT TTA GAT TTG i 

15 

TTA ACA ACA TCG GCA CCT CTC TCC GTG CAA AAC AAC TCT CTC TCC TTA GTC 

AAT TGT TGT AGC CGT GGA GAG AGG CAC GTT TTG TTG AGA GAG AGG AAT CAG 

ATT ACT TCT CCT TTA AAA GTT ATT AAT TCT ATG TTA GCC GTT GGG GTT AAC 
20 TAA TGA AGA GGA AAT TTT CAA TAA TTA AGA TAC AAT CGG CAA CCC CAA TTG 

CCG CCT TTT ACC ATC ACT GAC TCT GGA TTA GCT ATG GAC TTA GGA GAC GGT 
GGC GGA AAA TGG TAG TGA CTG AGA CCT AAT CGA TAC CTG AAT CCT CTG CCA 

25 CTT GOA-. CTA GGT GGC TCT AAG TTA ATA ATC AAT CTT GGT CCA GGT TTA CAA 
GAA CGT GAT CCA CCG AGA TTC AAT TAT TAG TTA GAA CCA GGT CCA AAT GTT 

ATG TCT- AAT GGA GCT ATT ACT TTA GCA CTA GAT GCA GCG CTG CCT TTG CAA 

TAC AGA TTA CCT CGA TAA TGA AAT CGT GAT CTA CGT CGC GAC GGA AAC GTT 

30 

TAT AGA GAC AAC CAA CTT CAA CTC AGA ATT GGC TCA ACA TCT GGC TTA ATT 

ATA TCT CTG TTG GTT GAA GTT GAG TCT TAA CCG AGT TGT AGA CCG AAT TAA 

ATG AGC GGA GTA ACA CAA ACA TTA AAC GTC AAT GCC AAT ACC GGC AAA GGT 
35 TAG TCG CCT CAT TGT GTT TGT AAT TTG CAG TTA CGG TTA TGG CCG TTT CCA 

CTT GCT GTT GAA AAC AAC TCA CTA GTT GTT AAG CTT GGG AAC GGT CTT CGC 
GAA CGA CAA CTT TTG TTG AGT GAT CAA CAA TTC GAA CCC TTG CCA GAA GCG 
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1 TTT GAT AGC TGG GGA AGC ATA ACT GTC TCG CCT ACT ACC ACT ACC CCT ACC 
AAA CTA TCG ACC CCT TCG TAT TGA CAG AGC GGA TGA TGG TGA TGG GGA TGG 

ACC CTA TGG ACC ACC GCA GAC CCA TCA CCT AAC GCC ACT TTT TAT GAA TCA 
5 TGG GAT ACC TGG TGG CGT CTG GGT AGT GGA TTG CGG TGA AAA ATA CTT AGT 

CTA GAC GCC AAA GTG TGG CTA GTT TTA GTA AAA TGC AAC GGC ATG GTT AAC 
GAT CTG CGG TTT CAC ACC GAT CAA AAT CAT TTT ACG TTG CCG TAC CAA TTG 

10 GGG ACC ATA TCC ATT AAA GCT CAG AAA GGC ATT TTA CTT AGA CCT ACA GCT 
CCC TGG TAT AGG TAA TTT CGA GTC TTT. CCG TAA AAT GAA TCT GGA TGT CGA 

• 

AGT TTT ATT TCC TTT GTC ATG TAT TTC TAC AGC GAT GGA ACA. TGG AGA AAA 
TCA AAA TAA AGG AAA CAG TAC ATA AAG ATG TCG CTA CCT TGT ACC TCT TTT * 

AAC TAT CCC GTG TTT GAC AAC GAA GGG ATA CTA GCA AAC AGT GCC ACG TGG 
TTG ATA GGG CAC AAA CTG TTG CTT CCC TAT GAT CGT TTG TCA CGG TGC ACC 

GGT TAT CGA CAA GGA CAG TCT GCC AAC ACT AAC GTT TCT AAT GCT GTA GAA 
20 CCA ATA GCT GTT CCT GTC AGA CGG TTG TGA TTG CAA AGA TTA CGA CAT CTT 

TTT ATG CCT AGC TCT AAA AGA TAT CCC AAT CAA AAA GGT TCT GAA GTT CAG 
AAA TAC GGA TCG AGA TTT TCT ATA GGG TTA GTT TTT CCA AGA CTT CAA GTC 

25 AAC ATG- "GCT CTT ACC TAC ACT TTT TTG CAA GGT GAT CCT AAC ATG GCC ATA 
TTG TAC CGA GAA TGG ATG TGA AAA AAC GTT CCA CTA GGA TTG TAC CGG TAT 

TCC TTT CAG AGT ATT TAT AAT CAT GCA TTA GAA GGC TAC TCA TTA AAA TTT 
AGG AAA GTC TCA TAA ATA TTA GTA CGT AAT CTT CCG ATG AGT AAT TTT AAA 

30 

ACC TGG CGC GTT CGA AAT AAT GAA CGT TTT GAC ATC CCC TGC TGC TCA TTT 
TGG ACC GCG CAA GCT TTA TTA CTT GCA AAA CTG TAG GGG ACG ACG AGT AAA 

TCT TAT GTA ACA GAA CAA TAA A 
35 AGA ATA CAT TGT CTT GTT ATT T 
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1 4 . The nucleic acid according to Claims 1 or 2 having a 

nucleotide sequence of the human adenovirus Type 41 long fiber protein 
gene which comprises: 

CCCGGGCAAC ATGCTCATCC AAATCTCGCC TAACATCACC TTCAGTGTCG TCTACAACGA 
5 GGGCCCGTTG TACGAGTAGG TTTAGAGCGG ATTGTAGTGG AAGTCACAGC AGATGTTGCT 

GATAAACAGT GGGTATGCTT TTACTTTTAA ATGGTCAGCC GAACCGGGAA AACCTTTTCA 
CTATTTGTCA CCCATACGAA AATGAAATT TACCAGTCGG CTTGGCCCTT TTGGAAAAGT 

10 CCCACCTACC GCTGTATTTT GCTACATAAC TGAACAATAA AATCATTGCA GGCACAATCT 
GGGTGGATGG CGACATAAAA CGATGTATPG ACTTGTTATT TTAGTAACGT CCGTGTTAGA 

• 

TCGCATTTCT TTTTTTCCAG ATG AAA CGA GCC AGA CTT GAA GAT* GAC TTC AAC CCC 
AGCGTAAAGA AAAAAAGGTC TAG TTT GOT CGG TOT GAA CTT CTA CTG AAG TTG 6gG 

15 

GTC TAC CCT TAC GAA CAC TAC AAT CCC CTT GAC ATC CCA TTT ATT ACA CCC 
CAG ATG GGA ATG CTT GTG ATG TTA GGG GAA CTG TAG GGT AAA TAA TGT GGG 

CCG TTT GCC TCC TCC AAC GGC TTG CAA GAA AAA CCA CCG GGA GTC CTC AGC 
20 GGC AAA CGG AGG AGG TTG CCG AAC GTT CTT TTT GGT GGC CCT CAG GAG TCG 

CTG AAi^ TAC ACT GAT CCA CTT ACA ACC AAA AAC GGG GCT TTA ACC TTA AAA 
GAC TTT ATG TGA CTA GGT GAA TGT TGG TTT TTG CCC CGA AAT TGG AAT TTT 

25 CTG GGC ' ACG GGA CTA AAC ATT GAT GAA AAT GGA GAT CTT TCT TCA GAT GCT 
GAC CCG TGC CCT GAT TTG TAA CTA CTT TTA CCT CTA GAA AGA AGT CTA CGA 

AGC GTG GAA GTT AGC GCC CCT ATT ACT AAA ACC AAC AAA ATC GTA GGT TTA 
TCG CAC CTT CAA TCG CGG GGA TAA TGA TTT TGG TTG TTT TAG CAT CCA AAT 



30 



AAT TAC ACT AAA CCT CTC GCC CTG CGA AGT AAC GCG CTC ACT CTT TCT TAC 
TTA ATG TGA TTT GGA GAG CGG GAC GCT TCA TTG CGC GAG TGA GAA AGA ATG 



AAC GCA CCC TTA AAC GTA GTA AAT AAC AAT TTA GCT TTA AAT ATC TCA CAA 
35 TTG CGT GGG AAT TTG CAT CAT TTA TTG TTA AAT CGA AAT TTA TAG AGT GTT 



CCT GTC ACT GTT AAT GCA AAC AAC GAA CTT TCT CTC TTA ATA GAC GCC CCA 
GGA CAG TGA CAA TTA CGT TTG TTG CTT GAA AGA GAG AAT TAT CTG CGG GGT 
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1 CTT AAT GCT GAC ACG GGC ACT CTT CGC CTT CAA AGT GCT GCA CCT CTT GGA 
GAA TTA CGA CTG TGC CCG TGA GAA GCG GAA GTT TCA CGA CGT GGA GAA CCT 

CTA GTG GAC AAA ACA CTA AAA GTT TTG TTT TCT AGC CCC CTC TAT CTA GAT 
5 GAT CAC CTG TTT TGT GAT TTT CAA AAC AAA AGA TCG GGG GAG ATA GAT CTA 

AAT AAC TTT CTT ACA CTA GCC ATT GAA CGC CCG CTA GTC CTA TCC AGT AGC 
TTA TTG AAA GAA TGT GAT CGG TAA CTT GCG GGC GAT CGA GAT AGO TCA TCG 

10 AGA GCA GTG ACC CTT AAG TAT TCA CCA CCT TTA AAA ATA GAA AAC GAA AAC 
TGT GGT CAC TGG GAA TTC ATA AGT GGT GGA AAT TTT TAT CTT TTG CTT TTG 

TTA ACC CTA AGC ACA GGC GGG CCT TTT ACT GTA AGC GGG GGA AAT CTA AAC 

AAT TGG GAT TCG TGT CCG CCC GGA AAA TGA CAT TCG CCC CCT TTA GAT TTG i 

15 

ATT ACT TCT CCT TTA AAA GTT ATT AAT TCT ATG TTA GCC GTT GGG GTT AAC 

TAA TGA AGA GGA AAT TTT CAA TAA TTA AGA TAC AAT CGG CAA CCC CAA TTG 

CCG CCT TTT ACC ATC ACT GAC TCT GGA TTA GCT ATG GAC TTA GGA GAC GGT 
20 GGC GGA AAA TGG TAG TGA CTG AGA CCT AAT CGA TAC CTG AAT CCT CTG CCA 

CTT GCA CTA GGT GGC TCT AAG TTA ATA ATC AAT CTT GGT CCA GGT TTA CAA 
GAA CGT GAT CCA CCG AGA TTC AAT TAT TAG TTA GAA CCA GGT CCA AAT GTT 

25 ATG TCT* AAT GGA GCT ATT ACT TTA GCA CTA GAT GCA GCG CTG CCT TTG CAA 
TAC AGA TTA CCT CGA TAA TGA AAT CGT GAT CTA CGT CGC GAC GGA AAC GTT 

TAT AGA GAC AAC CAA CTT CAA CTC AGA ATT GGC TCA ACA TCT GGC TTA ATT 

ATA TCT CTG TTG GTT GAA GTT GAG TCT TAA CCG AGT TGT AGA CCG ATT TAA 

30 

ATG AGC GGA GTA ACA CAA ACA TTA AAC GTC AAT GCC AAT ACC GGC AAA GGT 

TAC TCG CCT CAT TGT GTT TGT AAT TTG CAG TTA CGG TTA TGG CCG TTT CCA 

CTT GCT GTT GAA AAC AAC TCA CTA GTT GTT AAG CTT GGG AAC GGT CTT CGC 
35 GAA CGA CAA CTT TTG TTG AGT GAT CAA CAA TTC GAA CCC TTG CCA GAA GCG 

TTT GAT AGC TGG GGA AGC ATA ACT GTC TCG CCT ACT ACC ACT ACC CCT ACC 
AAA CTA TCG ACC CCT TCG TAT TGA CAG AGC GGA TGA TGG TGA TGG GGA TGG 
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5 



ACC CTA TGG ACC ACC GCA GAC CCA TCA CCT AAC GCC ACT TTT TAT GAA TCA 
TGG GAT ACC TGG TGG CGT CTG GGT AGT GGA TTG CGG TGA AAA ATA CTT ACT 

CTA GAC GCC AAA GTG TGG CTA GTT TTA GTA AAA TGG AAC GGC ATG GTT AAC 
GAT CTG CGG TTT CAC ACC GAT CAA AAT CAT TTT ACG TTG CCG TAC CAA TTG 

GGG ACC. ATA TCC ATT AAA GCT CAG AAA GGC ATT TTA CTT AGA CCT ACA GCT 
CCC TGG TAT AGG TAA TTT CGA GTC TTT CCG TAA AAT GAA TCT GGA TGT CGA 



10 AGT TTT ATT TCC TTT GTC ATG TAT TTC TAC AGC GAT GGA ACA TGG AGA AAA 
TCA AAA TAA AGG AAA CAG TAC ATA AAG ATG TCG CTA CCT TGT ACC TCT TTT 

AAC TAT CCC GTG TTT GAC AAC GAA GGG ATA CTA GCA AAC AGT GCC ACG TGG 
TTG ATA GGG CAC AAA CTG TTG CTT CCC TAT GAT CGT TTG TCA CGG TGC ACC 



15 



20 



25 



30 



35 



GGT TAT CGA CAA GGA CAG TCT GCC AAC ACT AAC GTT TCT AAT GCT GTA GAA 
CCA ATA GCT GTT CCT GTC AGA CGG TTG TGA TTG CAA AGA TTA CGA CAT CTT 

TTT ATG CCT AGC TCT AAA AGA TAT CCC AAT CAA AAA GGT TCT GAA GTT CAG 
AAA TAC GGA TCG AGA TTT TCT ATA GGG TTA GTT TTT CCA AGA CTT CAA GTC 

AAC ATG GCT CTT ACC TAC ACT TTT TTG CAA GGT GAT CCT AAC ATG GCC ATA 
TTG TAC CGA GAA TGG ATG TGA AAA AAC GTT CCA CTA GGA TTG TAC CGG TAT 

TCC TTT. CAG AGT ATT TAT AAT CAT GCA TTA GAA GGC TAC TCA TTA AAA TTT 
AGG AAA GTC TCA TAA ATA TTA GTA CGT AAT CTT CCG ATG AGT AAT TTT AAA 

ACC TGG CGC GTT CGA AAT AAT GAA CGT TTT GAC ATC CCC TGC TGC TCA TTT 
. TGG ACC GCG CAA GCT TTA TTA CTT GCA AAA CTG TAG GGG ACG ACG AGT PJJi . 

TCT TAT GTA ACA GAA CAA TAA A ATATTGTTGT TTTTGTTTTT ATAACTTTAT 
AGA ATA CAT TGT CTT GTT ATT T TATAACAACA AAAACAAAAA TATTGAAATA 

TGATACTTTT ACAGAATTC . 
ACTATGAAAA TGTCTTAAG 
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1 5. The nucleic acid according to Claim 1 or 2 having a 

nucleotide sequence encoding an amino acid sequence for Ad41 long 
fiber protein which comprises: 

Met Lys Arg Ala Arg Leu Glu Asp Asp ?he Asn Pro 
5 Val 7yr Pro Tyr Glu His Tyr Asn Pro Leu Asp He Pro Phe lie Thr Pro 
Pro Phe Ala Ser Ser Asn Gly Leu Gin Glu Lys Pro Pro Gly Val Leu Ser 
Leu Gly Thr Gly Leu Asn lie Asp Glu Asn Gly Asp Leu Ser Ser Asp Ala 
Ser Val Glu Val Ser Ala Pro He Thr Lys Thr Asn Lys lie Val Gly Leu 
Asn Tyr Thr Lys Pro Leu Ala Leu. Arg Ser Asn Ala Leu Thr Leu Ser Tyr 
10 Asn Ala Pro Leu Asn Val Val Asn Asn Asn Leu Ala Leu Asn He Ser Gin 
Pro Val Thr Val Asn Ala Asn Asn Glu Leu Ser Leu Leu He Asp Ala Pro 
Leu Asn Ala Asp Thr Gly Thr Leu Arg Leu Gin Ser Ala Ala Pro Leu Gly 
Leu Val Asp Lys Thr Leu Lys Val Leu Phe Ser Ser Pro Leu Tyr Leu Asp 
Asn Asn Phe Leu Thr Leu Ala He Glu Arg Pro Leu Ala Leu Ser Ser Ser i 
15 Arg Ala Val Thr Leu Lys Tyr Ser Pro Pro Leu Lys He Glu Asn Glu Asn Leu 
Thr Leu Ser Thr Gly Gly Pro Phe Thr Val Ser Gly Gly Asn Leu Asn Leu Thr 
Thr Ser Ala Pro Leu Ser Val Gin Asn Asn Ser Leu Ser Leu Val He Thr Ser 
Pro Leu Lys Val He Asn Ser Met Leu Ala Val Gly Val Asn Pro Pro Phe Thr 
He Thr Asp Ser Gly Leu Ala Met Asp Leu Gly Asp Gly Leu Ala Leu Gly .Gly 
20 S.er Lys Leu He He Asn Leu Gly Pro Gly Leu Gin Met Ser Asn Gly Ala He 
Thr Leu Ala Leu, Asp Ala Ala Leu Pro Leu Gin Tyr Arg Asp Asn Gin Leu Gin 
Leu Arg He Gly Ser Thr Ser Gly Leu He Met Ser Gly Val Thr Gin Thr Leu 
Asn Val Asn Ala Asn Thr Gly Lys Gly Leu Ala Val Glu Asn Asn Ser Leu Val 
Val Lys Leu Gly Asn Gly Leu Arg Phe Asp Ser Trp Gly Ser He Thr Val Ser 
25 Pro Thr Thr Thr- Thr Pro Thr Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn 
Ala Thr Phe Tyr Glu Ser Leu Asp Ala Lys Val Trp Leu Val Leu Val Lys Cys 
Asn Gly Met Val Asn Gly Thr He Ser He Lys Ala Gin Lys Gly He Leu Leu 
Arg Pro Thr Ala Ser Phe He Ser Phe Val Met Tyr Phe Tyr Ser Asp Gly Thr 
Trp Arg Lys Asn Tyr Pro Val Phe Asp Asn Glu Gly He Leu Ala Asn Ser Ala 
30 Thr Trp Gly Tyr Arg Gin Gly Gin Ser Ala Asn Thr Asn Val Ser Asn Ala Val 
Glu Phe Met Pro Ser Ser Lys Arg Tyr Pro Asn Gin Lys Gly Ser Gl^: Val Gin 
Asn Met Ala Leu Thr Tyr Thr Phe Leu Gin Gly Asp Pro Asn Met A. a He Ser 
Phe Gin Ser He Tyr Asn His Ala Leu Glu Gly Tyr Ser Leu Lys Phe Thr Trp 
Arg Val Arg Asn Asn Glu Arg Phe Asp He Pro Cys Cys Ser Phe Ser Tyr Val 
35 Thr Glu Gin 
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1 6 . The nucleic acid according to Claim 1 or 2 having 

a nucleotide sequence o£ human adenovirus type 41 short fiber 
protein which comprises: 

ATG AAA AGA ACC AGA ATT 
5 TAG TTT TCT TGG TOT TAA 

GAA GAG GAG TTG AAC CCC GTC TAG CCC TAT GAG ACC TTG TCA ACT GGC 
CTT GTG CTG AAG TTG GGG GAG ATG GGG ATA CTG TGG AAG AGT TGA GGG 

10 AGC ATC CCC TAT GTA GCT CCG CCC TTC GTT TCT TCT GAC GGG TTA GAG 
TCG TAG GGG ATA CAT CGA GGC GGG AAG CAA AGA AGA CTG CCC AAT GTC 

GAA AAA CCC CCA GGA GTT TTA GCA CTG AAG TAC ACT GAC CCC ATT ACT 
CTT TTT GGG GGT CCT CAA AAT CGT GAG TTC ATG TGA CTG GGG TAA TGA 

15 

ACC AAT GCT AAG CAT GAG CTT ACT TTA AAA CTT GGA AGC AAC ATA ACT 
TGG TTA CGA TTC GTA CTC GAA TGA AAT TTT GAA CCT TCG TTG TAT TGA 

TTA GAA AAT GGG TTA CTT TCG GGC ACA GTT CCC ACT GTT TCT CCT CCC 
20 AAT CTT TTA CCC AAT GAA AGC GGG TGT CAA GGG TGA CAA AGA GGA GGG 

CTT ACA AAC AGT AAC AAC TCC CTG GGT TTA GCC ACA TCC GCT CCC ATA 
GAA TGT TTG TCA TTG TTG AGG GAC CCA AAT GGG TGT AGG CGA GGG TAT 

25 GCT GTA TCA GCT AAC TCT CTC ACA TTG GCC ACC GCC GCA CCA CTG ACA 
CGA CAT AGT CGA TTG AGA GAG TGT AAC GGG TGG GGG CGT GGT GAC TGT 

GTA AGC AAC AAC CAG CTT AGT ATT AAC GCG GGC AGA GGT TTA GTT ATA 

CAT TCG TTG TTG GTC GAA TCA TAA TTG CGC CCG TCT CCA AAT CAA TAT 

30 

ACT AAC AAT GCC TTA ACA GTT AAT CCT ACC GGA GCG CTA GGT TTC AAT 

TGA TTG TTA CGG AAT TGT CAA TTA GGA TGG CCT CGC GAT CCA AAG TTA 

AAC ACA GGA GCT TTA CAA TTA AAT GCT GCA GGA GGA ATG AGA GTG GAC 
35 TTG TGT CCT CGA AAT GTT AAT TTA CGA CGT CCT CCT TAC TCT CAC CTG 

GGT GCC AAC TTA ATT CTT CAT GTA GCA TAA CCC TTT GAA GCA ATC AAC 
CCA CGG TTG AAT TAA GAA GTA CAT CGT ATA GGG AAA CTT CGT TAG TTG 
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1 CAG CTA ACA CTG CGA TTA GAA AAC GGG TTA GAA GTA ACC AGC GGA GGA 
GTC GAT TGT GAG GOT AAT CTT TTG CCC AAT CTT CAT TGG TOG CCT CCT 

AAG CTT AAC GTT AAG TTG GGA TCA GGC CTC CAA TTT GAC AGT AAC GGA 
5 TTC GAA TTG CAA TTC AAC CCT AGT CCG GRG GTT AAA CTG TCA TTG CCT 

CGC ATT GCT ATT AGT AAT AGC AAC CGA ACT CGA AGT GTA CCA TCC CTC 
GCG TAA CGA TAA TCA TTA TCG TTG GCT TGA GCT TCA CAT GGT AGG GAG 

10 ACT ACC ATT TGG TCT ATC TCG CCT ACG CCT AAC TGC TCC ATT TAT GAA 
TGA TGG TAA ACC AGA TAG AGC GGA TGC GGA TTG ACG AGG TAA ATA CTT 

ACC CAA GAT GCA AAC CTA TTT CTT TGT CTA ACT AAA AAC GGA GCT CAC 

« 

TGG GTT CTA CGT TTG GAT AAA GAA ACA GAT TGA TTT TTG CCT CGA GTG 

15 

GTA TTA GGT ACT ATA ACA ATC AAA GGT CTT AAA GGA GCA CTG CGG GAA 
CAT AAT CCA TGA TAT TGT TAG TTT CCA GAA TTT CCT CGT GAC GCC CTT 

ATG CAC GAT AAC GCT CTA TCT TTA AAA CTT CCC TTT GAC AAT CAG GGA 
20 TAC GTG CTA TTG CGA GAT AGA AAT TTT GAA GGG AAA CTG TTA GTC CCT 

AAT TTA CTT AAC TGT GCC TTG GAA TCA TCC ACC TGG CGT TAC CAG GAA 
TTA AAT GAA TTG ACA CGG AAC CTT AGT AGG TGG ACC GCA ATG GTC CTT 

25 ACC AAC- GCA GTG GCC TCT AAT GCC TTA ACA TTT ATG CCC AAC AGT ACA 
TGG TTG CGT CAC CGG AGA TTA CGG AAT TGT AAA TAC GGG TTG TCA TGT 

GTG TAT CCA CGA AAC AAA ACC GCT CAC CCG GGC AAC ATG CTC ATC CAA 

CAC ATA GGT GCT TTG TTT TGG CGA GTG GGC CCG TTG TAC GAG TAG GTT 

30 

ATC TCG CCT AAC ATC ACC TTC AGT GTC GTC TAC AAC GAG ATA AAC AGT 

TAG AGC GGA TTG TAG TGG AAG TCA CAG CAG ATG TTG CTC TAT TTG TCA 

GGG TAT GCT TTT ACT TTT AAA TGG TCA GCC GAA CCG GGA AAA CCT TTT 
35 CCC ATA CGA AAA TGA AAA TTT ACC AGT CGG CTT GGC CCT TTT GGA AAA 

CAC CCA CCT ACC GCT GTA TTT TGC TAC ATA ACT GAA CAA lAA 
GTG GGT GGA TGG CGA CAT AAA ACG ATG TAT TGA CTT GTT ATT 
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1 7 . The nucleic acid according to Claim 1 or 2 having a 

nucleotide sequence of human adenovirus type 41 Tak short fiber protein 
which comprises : 

GATATCAGTT GTTTGTCAAG TTTTTCCAGC AGCACCACCT GCCCTTCCTC CCAACTTTCG 
5 CTATAGTCAA CAAACAGTTC AAAAAGGTCG TCGTGGTGGA CGGGAAGGAG GGTTGAAAGC 

TAGGGGATGT GCCAACGGGC AGCAAACTTT CTCCACGTCC TAAAGGGTAT ATCGGTGTTC 
ATCCCCTACA CGGTTGCCCG TCGTTTGAAA GAGGTGCAGG ATTTCCCATA TAGCCACAAG 

10 ACCTTTTTAC CCTGACCCAC GATCTTCATC TTGCAG ATG AAA AGA ACC AGA ATT 
TGGAAAAATG GGACTGGGTG CTAGAAGTAG AACGTC TAG TTT TCT TGG TCT TAA 



15 



GAA GAG GAG TTC AAC CCC GTC TAG CCC TAT GAG ACC TTC TCA ACT CCC 

CTT CTG CTG AAG TTG G6G CAG ATG GGG ATA CTG TGG AAG AGt'tGA GGG 

AGC ATC CCC TAT GTA GCT CCG CCC TTC GTT TCT TCT GAG GGG TTA CAG 

TGG TAG GGG ATA CAT CGA GGC GGG AAG CAA AGA AGA CTG CCC AAT GTC 



20 



GAA AAA CCC CCA GGA GTT TTA GCA CTC AAG TAG ACT GAG CCC ATT ACT 
CTT TTT GGG GGT CCT CAA AAT CGT GAG TTC ATG TGA CTG GGG TAA TGA 



ACC AAT GCT AAG CAT GAG CTT ACT TTA AAA CTT GGA AGC AAC ATA ACT 
TGG TTA CGA TTC GTA CTC GAA TGA AAT TTT GAA CCT TGG TTG TAT TGA 



25 TTA GAA AAT GGG TTA CTT TCG GCC ACA GTT CCC ACT GTT TCT CCT CCC 
AAT CTT TTA CCC AAT GAA AGC CGG TGT CAA GGG TGA CAA AGA GGA GGG 



CTT ACA AAC AGT AAC AAC TCC CTG GGT TTA GCC ACA TCC GCT CCC ATA 

GAA TGT TTG TCA TTG TTG AGG GAG CCA AAT CGG TGT AGG CGA GGG TAT 

30 

GCT GTA TCA GCT AAC TCT CTC ACA TTG GCC ACC GCC GCA CCA CTG ACA 

CGA CAT AGT CGA TTG AGA GAG TGT AAC CGG TGG CGG CGT GGT GAG TGT 

GTA AGC AAC AAC CAG CTT AGT ATT AAC GCG GGC AGA GGT TTA GTT ATA 

35 CAT TCG TTG TTG GTC GAA TCA TAA TTG CGC CCG TCT CCA AAT CAA TAT 



ACT AAC AAT GCC TTA ACA GTT AAT CCT ACC GGA GCG CTA GGT TTC AAT 
TGA TTG TTA CGG AAT TGT CAA TTA GGA TGG CCT CGC GAT CCA AAG TTA 
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1 AAC ACA GGA GCT TTA CAA TTA AAT GCT GCA GGA GGA ATG AGA GTG GAC 
TTG TGT CCT CGA AAT GTT AAT TTA CGA CGT OCT CCT TAG TCT CAC GTG 

GGT GCC AAC TTA ATT CTT CAT GTA GCA TAT CCC TTT GAA GCA ATC AAC 
5 CCA CGG TTG AAT TAA GAA GTA CAT CGT ATA GOG AAA CTT CGT TAG TTG 

CAG CTA ACA CTG CGA TTA GAA AAC GGG TTA GAA GTA ACC AGC GGA GGA 
GTC GAT TGT GAC GCT AAT CTT TTG CCC AAT CTT CAT TGG TCG CCT CCT 

10 AAG CTT AAC GTT AAG TTG GGA TCA GGC CTC CAA TTT GAC AGT AAC GGA 
TTC GAA TTG CAA TTC AAC CCT AGT CCG GAG GTT AAA CTG TCA TTG CCT 

CGC ATT GCT ATT AGT AAT AGC AAC CGA ACT CGA AGT GTA CCA. TCC CTC 

GCG TAA CGA TAA TCA TTA TCG TTG GCT TGA GCT TCA CAT GGT AGG GAG ( 

15 

ACT ACC ATT TGG TCT ATC TCG CCT ACG CCT AAC TGC TCC ATT TAT GAA 

TGA TGG TAA ACC AGA TAG AGC GGA TGC GGA TTG ACG AGG TAA ATA CTT 

ACC CAA GAT GCA AAC CTA TTT CTT TGT CTA ACT AAA AAC GGA GCT CAC 
20 TGG GTT CTA CGT TTG GAT AAA GAA ACA GAT TGA TTT TTG CCT CGA GTG 

GTA TTA GGT ACT ATA ACA ATC AAA GGT CTT AAA GGA GCA CTG CGG GAA 
CAT AAT CCA TGA TAT TGT TAG TTT CCA GAA TTT CCT CGT GAC GCC CTT 

25 ATG CAC -GAT AAC GCT CTA TCT TTA AAA CTT CCC TTT GAC AAT CAG GGA 
TAC GTG CTA TTG CGA GAT AGA AAT TTT GAA GGG AAA CTG TTA GTC CCT 

AAT TTA • CTT AAC TGT GCC TTG GAA TCA TCC ACC TGG CGT TAC CAG GAA 
TTA AAT GAA TTG ACA CGG AAC CTT AGT AGG TGG ACC GCA ATG GTC CTT 

30 

ACC AAC GCA GTG GCC TCT AAT GCC TTA ACA TTT ATG CCC AAC AGT ACA 
TGG TTG CGT CAC CGG AGA TTA CGG AAT TGT AAA TAC GGG TTG TCA TGT 

GTG TAT CCA CGA AAC AAA ACC GCT CAC CCG GGC AAC ATG CTC ATC CAA 
35 CAC ATA GGT GCT TTG TTT TGG CGA GTG GGC CCG TTG TAC GAG TAG GTT 
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1 ATC TCG CCT AAC ATC ACC TTC AGT GTC GTC TAG AAC GAG ATA AAC AGT 
TAG AGC GGA TTG TAG TGG AAG TCA GAG GAG ATG TTG GTC TAT TTG TCA 

GGG TAT GCT TTT ACT TTT AAA TGG TCA GCC GAA CCG GGA AAA CCT TTT 
5 CCC ATA GGA AAA TGA AAA -TTT ACC AGT GGG CTT GGC CCT TTT GGA AAA 

CAC CCA CCT ACC GCT GTA TTT TGC TAC ATA ACT GAA CAA TAA 
GTG GGT GGA TGG GGA CAT AAA ACG ATG TAT TGA CTT GTT ATT 

10 AATCATTGCA GGCACAATCT TCGCATTTCT TTTTTTCCAG ATGAAACGAG CCAGACTTGA 

TTAGTAACGT CCGTGTTAGA AGCGTAAAGA AAAAAAGGTC TACTTTGCTC GGTCTGAACT 

AGATGACTTC AACCCCGTCT AC 

TCTACTGAAG TTGGGGCAGA TG 

15 8 . The nucleic acid according to Claim 1 or 2 having a nucleotide sequence 

encoding an amino acid sequence for Ad41 short fiber protein which conprises: 
Met Lys Arg Thr Arg lie Glu Asp Asp Phe Asn Pro Val Tyr Pro Tyr Asp Thr Phe 
Ser Thr Pro Ser lie Pro Tyr Val Ala Pro Pro Phe Val Ser Ser Asp Gly Leu Gin 
Glu Lys Pro Pro Gly Val Leu Ala Leu Lys Tyr Thr Asp Pro lie Thr Thr Asn Ala 
20 Lys His Glu Leu Thr Leu Lys Leu Gly Ser Asn lie Thr Leu Glu Asn Gly Leu Leu 
Ser Ala Thr Val Pro Thr Val Ser Pro Pro Leu Thr Asn Ser Asn Asn Ser Leu Gly 
Leu Ala Thr Ser Ala Pro lie Ala Val Ser Ala Asn Ser Leu Thr Leu Ala Thr Ala 
Ala Pro Leu Thr Val Ser Asn Asn Gin Leu Ser lie Asn Ala Gly Arg Gly Leu Val 
lie Thr Asn Asn Ala Leu Thr Val Asn Pro Thr Gly Ala Leu Gly Phe Asn Asn Thr 
25 Gly -Ala Leu Gin Leu Asn Ala Ala Gly Gly Met Arg Val Asp Gly Ala Asn Leu He 
Leu fiis Val Ala Tyr Pro Phe Glu Ala He Asn Gin Leu Thr Leu Arg Leu Glu Asn 
Gly Leu Glu Val Thr Ser Gly Gly Lys Leu Asn Val Lys Leu Gly Ser Gly Leu Gin 
Phe Asp Ser Asn Gly Arg He Ala He Ser Asn Ser Asn Arg Thr Arg Ser Val Pro 
Ser Leu Thr Thr He Trp Ser He Ser Pro Thr Pro Asn Cys Ser He Tyr Glu Thr 
30 Gin Asp Ala Asn Leu Phe Leu Cys Leu Thr Lys Asn Gly Ala His Val Leu Gly Thr 
He Thr He Lys Gly Leu Lys Gly Ala Leu Arg Glu Met His Asp Asn Ala Leu Ser 
Leu Lys Leu Pro Phe Asp Asn Gin Gly Asn Leu Leu Asn Cys Ala Leu Glu Ser Ser 
Thr Trp Arg Tyr Gin Glu Thr Asn Ala Val Ala Ser Asn Ala Leu Thr Phe Met Pro 
Asn Ser Thr Val Tyr Pro Arg Asn Lys Thr Ala His Pro Gly Asn Met Leu He Gin 
35 He Ser Pro Asn He Thr Phe Ser Val Val Tyr Asn Glu He Asn Ser Gly Tyr Ala 
Phe Thr Phe Lys Trp Ser Ala Glu Pro Gly Lys Pro Phe His Pro Pro Thr Ala Val 
. Phe Cys Tyr He Thr Glu Gin 
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1 9 , The nucleic acid according to Claim 1 or 2 encoding the E3 region which 
encodes RL-1, RL-2, RL-3, RL-4, RL-5, and RL- 6 protein of human adenovirus 
Type 41 having a nucleoside sequence which comprises: 

10 20 30 40 50 60 

^w** ******* * * 

GAATTCGCGC CACTCGAAAC CAAATTTTGC TGGAGCAAGC TGCCCTGACC TCCACCCCGC 

CTTAACCGCG -GTGAGCTTTG GTTTAAAACG ACCTCGTTCG ACGGGACTGG AGGTGGGGCG 

70 80 90 100 110 120 

********** ** 

10 GAAGTCAATT GAACCCGCCC AATTGGCCCG CTGCCCAGGT GTATCAGGAA AACCCCGCTC 

CTTCAGTTAA CTTGGGCGGG TTAACCGGGC GAeGGGTGCA CATAGTGCTT TTGGGGCGAG 

130 140 150 160 170 ^180 

ir * * * * * * * * * * * 

CGACCACAGT TCTCCTGCCA CGCGACGCTG AGGCCGAAGT CCAAATGACT AACTCCOGAG 

15 GCTGGTGTCA AGAGGACGGT GCGCTGCGAC TCCGGCTTCA GGTTTACTGA TTGAGGCCTC 

190 200 210 220 230 240 

********** * 

CGCAATTAGC GGGCGGATCC AGACACGTCA GGTTCAGAGG TCGGTCCTCG CCCTACTCTC 

GCGTTAATCG CCCGCCTAGG TCTGTGCAGT CCAAGTCTCC AGCCAGGAGC GGGATGAGAG 

20 250 260 270 280 290 300 

********** * * 

CAGGTCCTAT AAAGAGGCTG ATTATCCGAG GCCGGGGTAT CCAGCTCAAC GACGAAGTGG 

GTCCAGGATA TTTCTCGGAC TAATAGGCTC CGGCCCCATA GGTCGAGTTG CTGCTTCACC 

25 



30 



35 
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310 320 330 340 350 360 

************ 

TGAGCTCCTT AACCGGTCTC CGACCTGACG GAGTTTTCCA GCTTGGAGGT GCCGGCC3CT 
ACTCGAGGAA TTGGCCAGAG GCTGGACTGC CTCAAAAGGT CGAACCTCCA CGGCCGGCGA 
370 380 390 400 410 420 

****T******** 
CCTCCTTCAC-TCCTCGCCAG GCGTACCTGA CACTCCAGAG CTCTTCTTCC -CAGCCTCGCT 
GGAGGAAGTG AGGAGCGGTC CGCATGGACT GTGAGGTCTC GAGAAGAAGG GTCGGAGCGA 
430 440 450 460 470 430 

************ 

CCGGCGGCAT TGGAACCCTC CAGTTTGTGG AGGAGTTTGT ACCCTCCGTT TACTTCAACC 

GGCCGCCGTA ACCTTGGGAG GTCAAACACC TCCTCAAACA TGGGAGGCAA ATGAAGTTGG 
490 500 510 520 i 530 *540 

* * * * * * * * * * * i * 
15 CATTCTCGGG CGCTCCTGGT CTTTACCCAG ACGACTTCAT CCCAAACTAC GACGCGGTGA 

GTAAGAGCCC GCGAGGACCA GAAATGGGTC TGCTGAAGTA GGGTTTGATG CTGCGCCACT 
550 560 570 580 590 600 

* * * « * * * ★ * * .* * 

GCGAATCTGT GGACGGCTAC GACTGAATCC CAATGGTGCG TCCGTGACTG TGTGGCTGCA 
20 CGCTTAGACA CCTGCCGATG CTGACTTAGG GTTACCACGC AGGCACTGAC ACACCGACGT 
610 620 630 640 650 660 

* *********** 

ACATCTACAT CGGCGCCGTA ATCCTTGCTA CTTTGTCTGA AAAGTCTGTG ATTTTTACTT 

TGTAGATGTA GCCGCGGCAT TAGGAACGAT GAAACAGACT TTTCAGACAC TAAAAATGAA 

25 - 670 680 690 700 710 720 

************ 

ACCGCTCCAG CGCTTGGATT ACATGAAGAT CTGTGTTCTT TTTTGTGTGC TAAGTTTAAC 
TGGGGAGGTC GCGAACCTAA TGTACTTCTA GACACAAGAA AAAACACACG ATTCAAATTG 
730 740 750 760 770 780 

20 * .* ** ** ** ** ** 

AAGTAGCCTA AGGACTTCAC CTACAACCGT TGGTTCCTTA CGTCAGCTAC AAGATTCCAC 
TTCATCGGA? TCCTGAAGTG GATGTTGGCA ACCAAGGAAT GCAGTCGATG TTCTAAGGTG 
790 800 810 820 830 84G 

************ 

35 CAAAGGTACA CACCAAACTC TTTATTTTTC TGAGTCTACC ACTTCTATTG CACTTAACTG 
GTTTCCATGT GTGGTTTGAG AAATAAAAAG ACTCAGATGG TGAAGATAAC GTGAATTGAC 
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850 860 870 880 890 900 

TTCTTGTCGT AACCAACTCG TTCAGTGGCG CGCTAACAGA CAATTTTGCA AACTATTTTG 
AAGAACAGCA TTGGTTGAGC AAGTCACCGC GCGATTGTCT GTTAAAACGT TTGATAAAAC 
910 920 930 940 950 960 

GGACGCTCTT "ATTGTTCAAG GAAACAACAG CCTTTGTAAC AACTGTACTG CTACTACTTT 
CCTGCGAGAA TAACAAGTTC CTTTGTTGTC GGAAACATTG TTGACATGAC GATGATGAAA 
970 980 990 1000 1010 1020 

AACTCfTACA CCtCCTTTTG TTCCCGGTCC ATACTTGTGC ATTGCGAGAG GAAGAGGGCC 
TTGAGAATGT GGAGGAAAAC AAGGGCCAGG TATGAACACG TAACCGTGTC CTTCTCCCCG 
1030 1040 1050 1060 1070 1080 

15 TAGCTGCTTT AATCGCTGGA CTTTACAAAA AGAGAACCTA ACCACTACCA CCCTCCTTCC 

ATCGACGAAA TTAGCGACCT GAAATGTTTT TCTCTTGGAT TGGTGATGGT GGGAGGAAGG 

1090 1100 1110 1120 1130 1140 
***** 

CCTTACTACT TATACTTTTT CCCAAAAAAA AATTTACTTT TTGCCCATTA TTGCACTTTT 

2b GGAATGATGA ATATGAAAAA GGGTTTTTTT TTAAATGAAA AACGGGTAAT AACGTGAAAA 
1150 1160 1170 1180 1190 1200 

******** 

GGCCTTTGTC TGTGTTATTA CCGCTAATTA CATTTTAATT TTCAATCTTG ATAATTTTTA 

CCGGAAACAG ACACAATAAT GGCGATTAAT GTAAAATTAA AAGTTAGAAC TATTAAAAAT 
25 * 1210 1220 1230 1240 1250 1260 

* * * ★ "* ★ ★ ★ ★ * * * 

CTAATCATGC TGCTGTTTTT ACTTTGCCTT CTTTTCTGCT CTGCCTATGC CGCCGTGCCA 
GATTAGTACG ACGACAAAAA TGAAACGGAA GAAAAGACGA GACGGATACG GCGGCACGGT 
1270 1280 1290 1300 1310 1320 

20 ************ 
GAAAAAACTC TTAACAACCT CGTTCGGGTG TATGCCTTAG TTGGTACCAA TCTATCCCTT 
jq,.p.pip.p.j.Q;^Q AATTGTTGGA GCAAGCCCAC ATACGGAATC AACCATGGTT AGATAGGGAA 
1330 1340 1350 1360 1370 1380 

* * * * * * * *-* * * * 

35 GATTCTATGA AAACTCCTCA GATTGACGAA CTTACtAGTC TTAGCTGGAT TAAACAGGAA 
CTAAGATACT TTTGAGGAGT CTAACTGCTT GAATGATCAG AATCGACCTA ATTTGTCCTT 
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1 1390 1400 1410 1420 ^1430 1440 

************ 
GACAATCCTA ACAAAAACTT ACAATCATTT TTTTTTATTG GTCAAAAACT CTGTGAAGTT 
CTGTTAGGAT TGTTTTTGAA TGTTAGTAAA AAAAAATAAC CAGTTTTTGA GACACTTCAA 
5 1450 1460 1470 1480 1490 1500 

************ 
ACCAAAGACA AAATCACTGT TTTTAACTAT TATCCGTTGG AATTTTCCTG CGCTAACGTA 
TGGTTTCTGT TTTAGTGACA AAAATTGATA ATAGGCAACC TTAAAAGGAC GCGATTGCAT 
1510 1520 1530 1540 1550 1560 

10* *********** 
ACCTTGTATT TGTATAATCT TAAAACTGAC GATTCTGGCC TCTATAATGG AAAGGCCCAT 
TGGAACATAA ACATATTAGA ATTTTGACTG CTAAGACCGG AGATATTACC TTTCCGGGTA 
1570 1580 1590 1600 1610 1620 

***********? 
15 ACCAAAGAGC TTGAACATAA CACCTATGTT AGGCTTTATG TTATTGACAT TCCTCCGCCT 
TGGTTTCTCG AACTTGTATT GTGGATACAA TCCGAAATAC AATAACTGTA AGGAGGCGGA 
1630 1640 1650 1660 1670 1680 

* *********** 

AAGTGTGACA TTACTTCACG TTACTTAGGC ATACAGGCTA CTGGGGAAGA TTATTGTTTA 
20 TTCACACTGT AATGAAGTGC AATGAATCCG TATGTCCGAT GACCCCTTCT AATAACAAAT 
1690 1700 1710 1720 1730 1740 

* * * * * * * * * * * * 
ATTGAAATCA ATTGCACTAA CTCCAAATAC CCAGCTGTGG TTAAATTTAA TGGCAGGCAA 
TAACTTTAGT TAACGTGATT GAGGTTTATG GGTCGACACC AATTTAAATT ACCGTCCGTT 

25 1750 1760 1770 1780 1790 1800 

************ 
AGCAACTTCT ACCATTATGT TAGCGAAAAC GGAAACAAAG AACTTCCAAA TTTTTATGAA 
TCGTTGAAGA TGGTAATACA ATCGCTTTTG CCTTTGTTTC TTGAAGGTTT AAAAATACTT 
1810 1820 1830 1840 1850 1860 

************ 

ACACACATCA CTGTTAATGG TACCCACAAA AGCTTTCACT TTAATTACCC TTTTAACGAC 
' TGTGTGTAGT GACAATTACC ATGGGTGTTT TCGAAAGTGA AATTAATGGG AAAATTGCTG 
1870 1880 1890 1900 1910 1920 

*,* ***** ****** 

35 CTTTGTCAAA CAACCAGCGC TCTACAATAT AATGACAATG TCCAGGTAGT CCTCATTCTT 
GAAACAGTTT GTTGGTCGCG AGATGTTATA TTACTGTTAC AGGTCCATCA GGAGTAAGAA 
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1 1930 1940 1950 1960 1970 1980 

**★★*****★*« 

CTCATAGTAG TTGGCTTAAT AATAATTTCC GCTAGTTTAA TATTGCTTTA TTGCCACCGC 
GAGTATCATC AACCGAATTA TTATTAAAGG CGATCAAATT ATAACGAAAT AACGGTGGCG 
5 1990 2000 2010 2020 2030 2040 

************ 

AAAAAAATCA AGGCCGAAGT TCAACATCAA CCAGTGCATA TTTGTTTAGA AAAATAAAAT 
TTTTTTTAGT TCCGGCTTCA AGTTGTAGTT GGTCACGTAT AAACAAATCT TTTTATTTTA 
2050 2060 2070 2080 2090 2100 

AAAAAAGAAA AGTCATACCA TTGAGGAGAA GAGGACGAAC AGACAGACGG TTAATAGATG 
2110 2120 2130 2140 2150 2160 

* * * ★ * * ★ * * * 

GCCTCCACCA CCTTCGCCGC AGTCTCCCAC CTTGATACGG ATTGTCTTCC CGCCTTGCTG 

15 CGGAGGTGGT 6GAAGCGGCG TCAGAGGGTG GAACTATGCC TAACAGAAGG GCGGAACGAC 

2170 2180 2190 2200 2210 2220 

* ★ ★ ★ ★ * ★ * * 

ACTTATCTCA TCTTCACCTC TGTTTGCTGC ACTGCCATCT GCAGCATTGC CACTTTTTTT 
TGAATAGAGT AGAAGTGGAG ACAAACGACG TGACGGTAGA CGTCGTAACG GTGAAAAAAA 
20 2230 2240 2250 2260 2270 2280 

* ★***★*★**** 

GTGGCCATTT TCCAAACTGC GGACTACCTA TACGTTAGAG TGGCATACTA TCGTCATCAT 

CACCGGTAAA AGGTTTGACG CCTGATGGAT ATGCAATCTC ACCGTATGAT AGCAGTAGTA 

2290 2300 2310 - 2320 2330 2340 

25 * ♦ ★ * ★ ★ ★ * * * * 

CCCCAATATA GGAACCACGA GGTGGCCGCC CTTCTGTGCC TGTCATGAAA GTTCCTCTTC 
GGGGTTATAT CCTTGGTGCT CCACCGGCGG GAAGACACGG ACAGTACTTT CAAGGAGAAG 
-2350 2360 2370 2380 2390 2400 

30 TCTGTCTTAT CCTCCTTCAC AAAGTCCTGG CCAACTGCCA CCTCCACCGG CCCACCGAGT 
AGACAGAATA GGAGGAAGTG TTTCAGGACC GGTTGACGGT GGAGGTGGCC GGGTGGCTCA 
2410 2420 2430 2440 2450 2460 

TCCTGCGCTG CTACTCAACA GAAACCTCTT CCTTTTGGCT GTACTCCATT ATTTTTATTT 
35 AGGACGCGAC GATGAGTTGT CTTTGGAGAA GGAAAACCGA CATGAGGTAA TAAAAATAAA 
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1 2470 2480 2490 2500 2510 2520 

************ 

TGATTTTCTT TGCCACCTTT TTGGGATTAC AAATTTACGG CTGCCTTCAC CTGGGCTGGA 
ACTAAAAGAA ACGGTGGAAA AACCCTAATG TTTAAATGCC GACGGAAGTG GACCCGACCT 
5 2530 2540 2550 2560 2570 2580 

************ 

TGCATCCTCC-CAACAACCTA CCCAGATTTC CTGGTTTCTT ATTACAGCCC CCGCCGCCCC 
ACGTAGGAGG GTTGTTGGAT GGGTCTAAAG GACCAAAGAA TAATGTCGGG GGCGGCGGGG 
2590 2600 / 2610 2620 2630 2640 

xo ************* 

CACCAGCTCC TGTACAGCGC GCTCCATCAG TTATTAGCTA CTTTCATCTT AACTCTGAAG 

GTGGTCGAGG ACATGTCGCG CGAGGTAGTC AATAATCGAT GAAAGTAGAA TTGAGACTTC 

2650 2660 2670 2680 2690 2700 

* * * * * * * * * * *j' * 

15 ATGTCTGACC AACTAGAAAT CGACGGGCAG CGCACTGAGC AGCTGATCCT TGCTCGGCGA 
TACAGACTGG TTGATCTTTA GCTGCCCGTC GCGTGACTCG TCGACTAGGA ACGAGCCGCT 
2710 2720 2730 2740 2750 2760 

************ 

AAACTCAAAC AACAAAACCA GGAATTGTTC AACCTTCAAG CCTTACACCA ATGCAAAAAG 
20 TTTGAGTTTG TTGTTTTGGT CCTTAACAAG TTGGAAGTTC GGAATGtGGT TACGTTTTTC 
2770 2780 2790 2800 2810 2820 

********** ** 

GGTCTTTTCT GCCTGGTTAA ACAAGCTGAA CTTTGCTATG ATGTAACCCA ACAGGGGCAT 
CCAGAAAAGA CGGACCAATT TGTTCGACTT GAAACGATAC TACATTGGGT TGTCCCCGTA 
25 • -2830 2840 2850 2860 2870 2880 

************ 

GAGCTATCAT ACACTTTAAA CAAGCAAAGA CAGAGCTTTA TGACTATGGT GGGGGTTAAG 
CTCGATAGTA TGTGAAATTT GTTCGTTTCT GTCTCGAAAT ACTGATACCA CCCCCAATTC 
2890 2900 2910 2920 2930 2940 

30 **********x* 

CCCATTAAGG TTACTCAGCA ATCCGGCCCA GTTGAGGGAA GCATTCTTTG TCAGTGCACC 
GGGTAATTCC AATGAGTCGT TAGGCCGGGT CAACTCCCTT CGTAAGAAAC AGTCACGTGG 
2950 2960 2970 2980 2990 3000 

************ 

35 AATTCTGAAT GCATGTACAC TATGGTAAAA ACCCTGTGTG GTCTCAGGGA ACTTCTCCCC 
TTAAGACTTA CGTACATGTG ATACCATTTT TGGGACACAC CAGAGTCCCT TGAAGAGGGG 
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3010 3020 3030 3040 3050 3060 



TTTAATTAAA GTTATCTGAT TAATAAAGCT TACCTTAAAT TTGATATCAG TTGTTTGTCA 
AAATTAATTT CAATAGACTA ATTATTTCGA ATGGAATTTA AACTATAGTC AACAAACAGT 
5 3070 . 3080 3090 3100 3110 3120 

***********'^ 
AGTTTTTCCA GCAGCACCAC CTGCCCTTCC TCCCAACTTT CGTAGGGGAT GTGCCAACGG 
TCAAAAAGGT ■ CGTCGTGGTG GACGGGAAGG AGGGTTGAAA GCATCCCCTA CACGGTTGCC 
3130 3140 3150 3160 3170 3180 

************ 
10 GCAGCAAACT TTCTCCACGT CCTAAAGGGT ATATCGGTGT TCACCTTTTT ACCCTGACCC 
CGTCGTTTGA AAGAGGTGCA GGATTTCCCA TATAGCCACA AGTGGAAAAA TGGGACTGGG 
3190 3200 3210 3220 3230 3240 

************* 
ACGATCTTCA TCTTGCAGAT GAAAAGAACC AGAATTGAAG ACGACTTCAA CCCCGTCTAC 
TGCTAGAAGT AGAACGTCTA CTTTTCTTGG TCTTAACTTC TGCTGAAGTT GGGGCApATG 
15 3250 3260 3275 3280 3290 3300 

******* ***** 
CCCTATGACA CCTTCTCAAC TCCCAGCATC CCCTATGTAG CTCCGCCCTT CGTTTCTTCT 
GGGATACTGT GGAAGAGTTG AGGGTCGTAG GGGATACATC GAGGCGGGAA GCAAAGAAGA 
3310 3320 3330 3340 3350 3360 

♦ .***★*** **** * 

20 GACGGGTTAC AGGAAAAACC CCCAGGAGTT TTAGCACTCA AGTACACTGA CCCCATTACT 
CTGCCCAATG TCCTTTTTGG GGGTCCTCAA AATCGTGAGT TCATGTGACT GGGGTAATGA 

3370 
* * 

ACCAATGCTA AGC 
TGGTTACGAT TCG 

23 ' '-' 

10 . The nucleic acid according to Claim 1 or 2 having a 
nucleotide sequence of Fig. 4 from base 683 to base 1204. 

11. The nucleic acid according to Claim 1 or 2 encoding an 
amino acid sequence for RL-1 which comprises: 

Met Lys He Cys Val Leu Phe Cys Val Leu Ser Leu Thr Ser Ser Leu Arg 
30 Thr Ser Pro Thr Thr Val Gly Ser Leu Arg Gin Leu Gin Asp Ser Thr Lys 
Gly Thr His Gin Thr Leu Tyr Phe Ser Glu Ser Thr Thr Ser lie Ala Leu 
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1 Asn Cys Ser Cys Arg Asn Gin Leu Val Gin Trp Arg Ala Asn Arg Gin Phe 
Cys Lys Leu Phe Trp Asp Ala Leu lie Val Gin Gly Asn Asn Ser Leu Cys 
Asn Asn Cys Thr Ala Thr Thr Leu Thr Leu Thr Pro Pro Phe Val Pro Gly 
Pro Tyr Leu Cys lie Gly Thr Gly Arg Gly Pro Ser Cys Phe Asn Arg Trp 

5 Thr Leu Gin Lys Glu Asn Leu Thr Thr Thr Thr Leu Leu Pro Leu Thr Thr 
Tyr Thr Phe Ser Gin Lys Lys lie Tyr Phe Leu Pro lie lie Ala Leu Leu 
Ala Phe Val -Cys Val lie Thr Ala Asn Tyr lie Leu lie Phe Asn Leu Asp 
Asn Phe Tyr 

12. The nucleic acid according to Claim 1 or 2 having 
10 a nucleotide sequence of Fig. 4 from base 1207 to base 2037. 

13. The nucleic acid according to Claim 1 of 2 encoding 
an amino acid sequence for RL-2 which comprises: 

Met Leu Leu Phe Leu Leu Cys Leu Leu Phe Cys Ser Ala Tyr, Ala Ala Val 
Pro Glu Lys Thr Leu Asn Asn Leu Val Arg Val Tyr Ala Leu Val Gly Thr i 

13 Asn Leu Ser Leu Asp Ser Met Lys Thr Pro Gin lie Asp Glu Leu Thr Ser 
Leu Ser Trp lie Lys Gin Glu Asp Asn Pro Asn Lys Asn Leu Gin Ser Phe 
Phe Phe lie Gly Gin Lys Leu Cys Glu Val Thr Lys Asp Lys lie Thr Val 
Phe Asn Tyr Tyr Pro Leu Glu Phe Ser Cys Ala Asn Val Thr Leu Tyr Leu 
Tyr Asn Leu Lys Thr Asp Asp Ser Gly Leu Tyr Asn Gly Lys Ala His Thr 

20 Lys Glu Leu Glu His Asn Thr Tyr Val Arg Leu Tyr Val lie Asp lie Pro 
Pro Pro Lys Cys Asp lie Thr Ser Arg Tyr Leu Gly lie Gin Ala Thr Gly 
Glu Asp Tyr Cys Leu lie Glu lie Asn Cys Thr Asn Ser Lys Tyr Pro Ala 
Val Val Lys Phe Asn Gly Arg Gin Ser Asn Phe Tyr His Tyr Val Ser Glu 
Asn Gly Asn Lys Glu Leu Pro Asn Phe Tyr Glu Thr His lie Thr Val Asn 

25 Gly Thr- His Lys Ser Phe His Phe Asn Tyr Pro Phe Asn Asp Leu Cys Gin 
Thr Thr Ser Ala Leu Gin Tyr Asn Asp Asn Val Gin Val Val Leu lie Leu 
Leu He Val Val Gly Leu lie He He Ser Ala Ser Leu lie Leu Leu Tyr 
Cys His Arg Lys Lys He Lys Ala Glu Val Gin His Gin Pro Val His He 
Cys Leu Glu Lys 

30 14. The nucleic acid according to Claim 1 or 2 having 

a nucleotide sequence of Fig. 4 from base 1730 to base 1909. 
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1 15. The nucleic acid according to Claim 1 of 2 encoding 

an amino acid sequence for RL-3 which comprises. 

Met Ala Gly Lys Ala Thr Ser Thr lie Met Leu Ala Lys Thr Glu Thr Lys 

Asn Phe Gin He Phe Met Lys His Thr Ser Leu Leu Met Val Pro Thr Lys 
5 Ala Phe Thr Leu He Thr Leu Leu Thr Thr Phe Val Lys Gin Pro Ala Leu 

Tyr Asn He Met Thr Met Ser Arg 

16. The nucleic acid according to Claim 1 or 2 having a 
nucleotide sequence of Fig. 4 from base 2056 to base 2328. 

17. The nucleic acid according to Claim 1 or 2 encoding 
10 an amino acid sequence for RL-4 which comprises: 

Met Val ThE Pro Leu Leu Leu Leu Val Cys Leu Pro He He Tyr Ala Ser 
Thr Thr Phe Ala Ala Val Ser His Leu Asp Thr Asp Cys Leu Pro Ala Leu ^ 
Leu Thr Tyr Leu He Phe Thr Ser Val Cys Cys Thr Ala He Cys Ser He 
Ala Thr Phe Phe Val Ala He Phe Gin Thr Ala Asp Tyr Leu Tyr Val Arg , 
15 Val Ala Tyr Tyr Arg His His Pro Gin Tyr Arg Asn His Glu Val Ala Ala 

Leu Leu Cys Leu Ser 

18. The nucleic acid according to Claim 1 or 2 having a 
nucleotide sequence of Fig. 4 from base 2325 to base 2648. 

19. The nucleic acid according to Claim 1 or 2 encoding 
20 an amino acid sequence for RL-5 which corapises: 

Met Lys Val Pro Leu Leu Cys Leu He Leu Leu His Lys Val Leu Ala Asn 
Cys His Leu His Arg Pro Thr Glu Phe Leu Arg Cys Tyr Ser Thr Glu Thr 
Ser Ser Phe Trp Leu Tyr Ser He He Phe He Leu He Phe Phe Ala Thr 
Phe Leu Gly Leu Gin He Tyr Gly Cys Leu His Leu Gly Trp Met His Pro 
25 Pro Asri- Asn Leu Pro Arg Phe Pro Gly Phe Leu Leu Gin Pro Pro Pro Pro 
Pro Pro Ala Pro Val Gin Arg Ala Pro Ser Val He Ser Tyr Phe His Leu 

Asn Ser Glu Asp Val 

20. The nucleic acid according to Claim 1 or 2 having a 
nucleotide sequence of Fig. 4 from base 2641 to base 3009. 

30 
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1 21. The nucleic acid according to Claim 1 or 2 encoding 

an amino acid sequence for RL-6 which comprises: 

Met 5er Asp Gin Leu Glu lie Asp Gly Gin Arg Thr Glu Gin Leu lie Leu 
Ala Arg Arg Lys Leu Lys Gin Gin A^n Gin Glu Leu Phe Asn Leu Gin Ala 
3 Leu His Gin Cys Lys Lys Gly Leu Phe Cys Leu Val Lys Gin Ala Glu Leu 
Cys Tyr Asp Val Thr Gin Gin Gly His Glu Leu Ser Tyr Thr Leu Asn Lys 
Gin Arg Gln-Ser Phe Met Thr Met Val Gly Val Lys Pro He Lys Val Thr 
Gin Gin Ser Gly Pro Val Glu Gly Ser He Leu Cys Gin Cys Thr Asn Ser 
Glu Cys Met Tyr Thr Met Val Lys Thr Leu Cys Gly Leu Arg Glu Leu Leu 
10 Pro Phe Asn 

22. A replicable expression vector compising the nucleic 
acid of Claim 1 or 2 operably linked to a nucleotide sequence capable 
of effecting expression of a polypeptide encoded by piny one 
of said nucleic acids. ( 
15 23. A recombinant DNA according to Claim 4 having the 

identifying characteristics of the Ad41 long fiber protein sequence 
accorded the £MBL accession number X16583. 

24. A recombinant ONA according to Clciim 7 having the 
identifying characteristics of the Ad41 short fiber protein sequence 

20 accorded the EMBL accession number X17016. 

25. A recombinant DNA according to Claim 9 having the 
identifying characteristics of the Ad41 E3 sequence encoding Ad41 
proteins RL-1 to RL-6 accorded the GenBank accession number M33160. 

26. A recombinant protein of human enteric adenovirus 
25 Type* 41 wherein said protein is long fiber protein, short fiber 

protein, RL-1, RL-2, RL-3, RL-4, RL-5 or RL-6. 
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1 27. The recombinant protein of Claim 26 of long fiber 

protein of human enteric adenovirus Type 41 wherein said protein has 
an amino acid sequence comprising: 

Met Lys Arg Ala Arg Leu Glu Asp Asp Phe Asn Pro 
5 Val Tyr Pro Tyr Glu His Tyr Asn Pro Leu Asp lie Pro Phe lie Thr Pro 
Pro Phe Ala Ser Ser Asn Gly Leu Gin Glu Lys Pro Pro Gly Val Leu Ser 
Leu Ciy Thr -Gly Leu Asn lie Asp Glu Asn Gly Asp Leu Ser Ser Asp Ala 
Ser Val Glu Val Ser Ala Pro He Thr Lys Thr Asn Lys He Val Gly Leu 
Asn Tyr Thr Lys Pro Leu Ala Leu Arg Ser Asn AJ.a Leu Thr Leu Ser Tyr 
10 Asn Ala Pro Leu Asn Val Val Asn Asn Asn Leu Ala Leu Asn He Ser Gin 
Pro Val Thr Val Asn Ala Asn Asn Glu L^u Ser Leu Leu He Asp Ala Pro 
Leu Asn Ala Asp Thr Gly Thr Leu Arg Leu Gin Ser Ala Ala Pro Leu Gly^ 
Leu Val Asp Lys Thr Leu Lys Val Leu Phe Ser Ser Pro Lev, Tyr Leu Asp 
Asn Asn Phe Leu Thr Leu Ala He Glu Arg Pro Leu Ala Leu Ser Ser SeiJ 
15 Arg Ala Val Thr Leu Lys Tyr Ser Pro Pro Leu Lys He Glu Asn Glu Asn 
Leu Thr Leu Ser Thr Gly Gly Pro Phe Thr Val Ser Gly Gly Asn Leu Asn 
Leu Thr Thr Ser Ala Pro Leu Ser Val Gin Asn Asn Ser Leu Ser Leu Val 
He Thr Ser Pro Leu Lys Val He Asn Ser Met Leu Ala Val Gly Val Asn 
Pro Pro Phe Thr He Thr Asp Ser Gly Leu Ala Met Asp Leu Gly Asp Gly 
20 Leu Ala Leu Gly Gly Ser Lys Leu He He Asn Leu Gly Pro Gly Leu Gin 
Met Ser Asn Gly Ala He Thr Leu Ala Leu Asp Ala Ala Leu Pro Leu Gin 
Tyr Arg Asp Asn Gin Leu Gin Leu Arg He Gly Ser Thr Ser Gly Leu He 
Met Ser Gly Val Thr Gin Thr Leu Asn Val Asn Ala Asn Thr Gly Lys Gly 
Leu Ala Val Glu Asn Asn Ser Leu Val Val Lys Leu Gly Asn Gly Leu Arg 
25 Phe Asp Ser Trp Gly Ser He Thr Val Ser Pro Thr Thr Thr Thr Pro Thr 
Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Ala Thr Phe Tyr Glu Ser 
Leu Asp Ala Lys Val Trp Leu Val Leu Val Lys Cys Asn Gly Met Val Asn 
Gly Thr He Ser He Lys Ala Gin Lys Gly He Leu Leu Arg Pro Thr Ala 
Ser Phe He Ser Phe Val Met Tyr Phe Tyr Ser Asp Gly Thr Trp Arg Lys 
30 Asn Tyr Pro Val Phe Asp Asn Glu Gly He Leu Ala Asn Ser Ala Thr Trp 
Gly Tyr Arg Gin Gly Gin Ser Ala Asn Thr Asn Val Ser Asn Ala Val Glu 
Phe Met Pro Ser Ser Lys Arg Tyr Pro Asn Gin Lys Gly Ser Glu Val Gin 
Asn Met Ala Leu Thr Tyr Thr Phe Leu Gin Gly Asp Pro Asn Met Ala He 
Ser Phe Gin Ser He Tyr Asn His Ala Leu Glu Gly Tyr Ser Leu Lys Phe 
35 Thr Trp Arg Val Arg Asn Asn Glu Arg Phe Asp He Pro Cys Cys Ser Phe 
Ser Tyr Val Thr Glu Gin 
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28 . The recombinant protein of Claim 26 of the short fiber 
protein of human enteric adenovirus Type 41 wherein said protein has 
an amino acid sequence comprising: 

Met Lys Arg Thr Arg lie Glu Asp Asp Phe Asn Pro Val Tyr Pro Tyr Asp 
Thr Phe Ser Thr Pro Ser lie Pro Tyr Val Ala Pro Pro Phe Val Ser Ser 
Asp Gly Leu Gin Glu Lys Pro Pro Gly Val Leu Ala Leu Lys Tyr Thr Asp 
Pro lie Thr, Thr Asn Ala Lys His Glu Leu Thr Leu Lys Leu Gly Ser Asn 
lie Thr Leu Glu Asn Gly Leu Leu Ser Ala Thr Val Pro Thr Val Ser Pro 
Pro Leu Thr Asn Ser Asn Asn Ser Leu Gly Leu Ala Thr Ser Ala Pro lie 
Ala Val Ser Ala Asn Ser Leu Thr Leu Ala Thr Ala Ala Pro Leu Thr Val 
Ser Asn Asn Gin Leu Ser lie Asn Ala Gly Arg Gly Leu Val lie Thr Asn 
Asn Ala Leu Thr Val Asn Pro Thr Gly Ala Leu Gly Phe Asn Asn Thr Gly 
Ala Leu Gin Leu Asn Ala Ala Gly Gly Met Arg Val Asp Gly Ala Asn Leu 
lie Leu His Val Ala Tyr Pro Phe Glu Ala lie Asn Gin Leu Thr Leu Arg 
15 Leu Glu Asn Gly Leu Glu Val Thr Ser Gly Gly Lys Leu Asn Val Lys Leu 
Gly Ser Gly Leu Gin Phe Asp Ser Asn Gly Arg lie Ala lie Ser Asn Ser 
Asn Arg Thr Arg Ser Val Pro Ser Leu Thr Thr lie Trp Ser lie Ser Pro 
Thr Pro Asn Cys Ser lie Tyr Glu Thr Gin Asp Ala Asn Leu Phe Leu Cys 
Leu Thr Lys Asn Gly Ala His Val Leu Gly Thr lie Thr lie Lys Gly Leu 
Lys Gly Ala Leu Arg Glu Met His Asp Asn Ala Leu Ser Leu Lys Leu Pro 
Phe Asp Asn Gin Gly Asn Leu Leu Asn Cys Ala Leu Glu Ser Ser Thr Trp 
Arg Tyr Gin Glu Thr Asn Ala Val Ala Ser Asn Ala Leu Thr Phe Met Pro 
Asn Ser Thr Val Tyr Pro Arg Asn Lys Thr Ala His Pro Gly Asn Met Leu 
He Gin He Ser Pro Asn He Thr Phe Ser Val Val Tyr Asn Glu He Asn 
Ser Cly, Tyr Ala Phe Thr Phe Lys Trp Ser Ala Glu Pro Gly Lys Pro Phe 
His Pro Pro Thr Ala Val Phe Cys Tyr He Thr Glu Gin 
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1 29 . The recombinant protein of Claim 26 of E3 RL-1 protein 

of human adenovirus Type 41 wherein said protein has an amino acid 
sequence comprising: 

Met Lys He Cys Val Leu Phe Cys Val Leu Ser Leu Thr Ser Ser Leu Arg 
5 Thr Ser Pro Thr Thr Val Gly Ser Leu Arg Gin Leu Gin Asp Ser Thr Lys 
Gly Thr His Gin Thr Leu Tyr Phe Ser Glu Ser Thr Thr Ser lie Ala Leu 
Asn Cys Ser Cys Arg Asn Gin Leu Val Gin Trp Arg Ala Asn Arg Gin Phe 
Cys Lys Leu Phe Trp Asp Ala Leu He Val Gin Gly Asn Asn Ser Leu Cys 
Asn Asn Cys Thr Ala Thr Thr Leu T?hr Leu Thr Pro Pro Phe Val Pro Gly 
10 Pro Tyr Leu Cys He Gly Thr Gly Arg Gly Pro Ser Cys Phe Asn Arg Trp 
Thr Leu Gin Lys Glu Asn Leu Thr Thr Thr Thr Leu Leu Pro Leu Thr Thr 
Tyr Thr Phe Ser Gin Lys Lys He Tyr Phe Leu Pro He He Ala Leu Leu ^ 
Ala Phe Val Cys Val He Thr Ala Asn Tyr He Leu He Phe, Asn Leu Asp 
Asn Phe Tyr * 

15 

30 . The recombinant protein of Claim 26 of E3 RL-2 protein 
of human adenovirus Type 41 wherein said protein has an amino acid 
sequence comprising: 

Met Leu Leu Phe Leu Leu Cys Leu Leu Phe Cys Ser Ala Tyr Ala Ala Val 
20 Pro Glu Lys Thr Leu Asn Asn Leu Val Arg Val Tyr Leu Val Gly Thr , 
Asn Leu Ser Leu Asp Ser Met Lys Thr Pro Gin He Asp Glu Leu Thr Ser 
Leu Ser Trp He Lys Gin Glu Asp Asn Pro Asn Lys Asn Leu Gin Ser Phe 
Phe Phe He Gly Gin Lys Leu Cys Glu Val Thr Lys Asp Lys He Thr Val 
Phe Asn Tyr Tyr Pro Leu Glu Phe Ser Cys Ala Asn Val Thr Leu Tyr Leu 
25 Tyr Asn Leu Lys Thr Asp Asp Ser Gly Leu Tyr Asn Gly Lys Ala His Thr 
Lys Glu Leu Glu His Asn Thr Tyr Val Arg Leu Tyr Val He Asp He Pro 
Pro Pro Lys Cys Asp He Thr Ser Arg Tyr Leu Gly He Gin Ala Thr Gly 
Glu Asp Tyr Cys Leu He Glu He Asn Cys Thr Asn Ser Lys Tyr Pro Ala 
Val Val Lys Phe Asn Gly Arg Gin Ser Asn Phe Tyr His Tyr Val Ser Glu 
30 Asn Gly Asn Lys Glu Leu Pro Asn Phe Tyr Glu Thr His He Thr Val Asn 
Gly Thr His Lys Ser Phe His Phe Asn Tyr Pro Phe Asn Asp Leu Cys Gin 
Thr Thr Ser Ala Leu Gin Tyr Asn Asp Asn Val Gin Val Val Leu He Leu 
Leu He Val Val Gly Leu He He He Ser Ala Ser Leu He Leu Leu Tyr 
Cys His Arg Lys Lys He Lys Ala Glu Val Gin His Gin Pro Val His He 
35 Cys Leu Glu Lys 
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1 31. The recombinant protein of Claim 26 of E3 Rl-3 

protein of hxunan adenovirus Type 41 wherein said protein has an amino 
acid sequence con^rising: 

Met Ala Gly Lys Ala Thr Ser Thr He Met Leu Ala Lys Thr Glu Thr Lys 
5 Asn Phe Gin He Phe Met Lys His Thr Ser Leu Leu Met Val Pro Thr Lys 
Ala Phe Thr Leu He Thr Leu Leu Thr Thr Phe Val Lys Gin Pro Ala Leu 
Tyr Asn • He Wet Thr Met Ser Arg 

32. The recombinant protein of Claim 26 of E3 Rl-4 protein 
of human adenovirus Type 41 wherein said 
10 protein has an amino acid sequence comprising: 

Met Vai Thr Pro Leu Leu Leu Leu Val Cys Leu Pro He lie Tyr Ala Ser 
Thr Thr Phe Ala Ala Val Ser His Leu Asp Thr Asp Cys Leu Pro Ala Leu 
Leu Thr Tyr Leu He Phe Thr Ser Val Cys Cys Thr Ala Ile^ Cys Ser He 
Ala Thr Phe Phe Val Ala He Phe Gin Thr Ala Asp Tyr Leu Tyr Val Arg ; 
15 Val Ala Tyr Tyr Arg His His Pro Gin Tyr Arg Asn His Glu Val Ala Ala 
Leu Leu Cys Leu Ser 

33. The recombinant protein of Claim 26 of E3 RL-5 protein 
of human adenovirus Type 41 wherein said protein has an amino acid 
sequence comprising: 
20 Met Lys Val Pro Leu Leu Cys Leu He Leu Leu His Lys Val Leu Ala Asn 
Cys His Leu His Arg Pro Thr Glu Phe Leu Arg Cys Tyr Ser Thr Glu Thr 
Ser Ser Phe Trp Leu Tyr Ser He He Phe He Leu He Phe Phe Ala Thr 
Phe Leu Gly Leu Gin He Tyr Gly Cys Leu His Leu Gly Trp Met His Pro 
Pro Asn Asn Leu Pro Arg Phe Pro Gly Phe Leu Leu Gin Pro Pro Pro Pro 
25 Pro Pro Ala Pro Vai Gin Arg Ala Pro Ser Val He Ser Tyr Phe His Leu 
Asn Ser Glu Asp Val 
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1 34. The recontbinant protein of Claim 26 of £3 RL-6 protein 

of human adenovirus Type 41 wherein said protein has an amino acid 
sequence comprising : 

Met Ser Asp Gin Leu Glu lie Asp Gly Gin Arg Thr Glu Gin Leu lie Leu 
5 Ala Arg Arg Lys Leu Lys Gin Gin Asn Gin Glu Leu Phe Asn Leu Gin Ala 
Leu His Gin Cys Lys Lys Gly Leu Phe Cys Leu Val Lys Gin Ala Glu Leu 
Cys Tyr Asp -Val Thr Gin Gin Gly His Glu Leu Ser Tyr Thr Leu Asn Lys 
Gin Arg Gin Ser Phe Met Thr Met Val Gly Val Lys Pro He Lys Val Thr 
Gin Gin Ser Gly Pro Val Glu Gly Ser He Leu Cys Gin Cys Thr Asn Ser 
10 Glu Cys Met Tyr Thr Met Val Lys Thr Leu Cys Gly Leu Arg Glu Leu Leu 
Pro Phe Asn 

35. A polypeptide encoded by the nucleic acid of any o^e 
of Claims 1-21. 

36. A polypeptide comprising an antigenic fragment * 
15 of human adenovirus Type 41 long fiber protein, short fiber protein, 

Rl-1 protein, RL-2 protein, RL-3 protein, KL-4 protein, RL-5 protein 
or RL-S protein . 

37. Antibodies against a long fiber protein of hiaman 
adenovirus Type 41, a short fiber protein, RL-1 protein, 

20 RL-2 protein, RL-^3 protein, RL-4 protein, RL-5 protein or 
RL-6 protein. 

38. Antibodies against the polypeptides of Claims 35 or 36. 

39. The antibodies according to Claim 38 which 
25 are specific to the tail, shaft or Icnob region of the Ad41 

long fiber protein or short fiber protein, 

40. The antibodies according to any one of Claims 37 zo 
39 wherein said antibodies are monoclonal or polyclonal. 

41. A vaccine for immunization against a human 
30 adenovirus comprising the administration of an effective 

amount of at least one of Ad41 long fiber protein, short fiber protein, 
RL-1, RL-2, RL-3, Rl-4, RL-5 or RL-6 and/or active fragments thereof 
in association with a conventional vaccine carrier. 



wo 91/08310 



-68- 



PCr/US90/06887 



1 42. A vaccine for imnunlzatlon against a htunan 

adenovirus cortprising the administration of a mixture of 
inactivated Ad4I and at least one of recombinant proteins 
as described in any one of Claims 26 to 34 or active 

5 fragments thereof in association with a conventional vaccine carrier. 

43- The vaccine according to Claim 41 or 42 wherein the 
hximan adenovirus is Ad41 or Ad40. 

44. The vaccine according to any one of Claims 41 
to 43 wherein the dosage effective range is about 0.001- 

10 100 mg antigen/kg body. 

45. A host organism or cell transformed by the 
nucleic acid of any one of Claims 1 to 21. 

a 

46. A host organism or cell according to Claim 45 ( 
13 wherein the host is yeast or bacterium. 

47. A method o£ detecting or diagnosing human 
adenovirus comprising contacting serum, tissue, or tissue 
extracts of an individual to be tested with an antibody 
against Ad41 long fiber protein, short fiber protein, RL-1 

20 protein, RL-2 protein, RL-3 protein, RL-4 protein, RL-5 

protein or RL-6 protein or an active fragment thereof, for 
a time and under conditions necessary to form an antibody- 
antigen conplex, and detecting any resultant antibody- 
ant igen complex . 
25 • 48 • A method for detecting human adenovirus 

Type 41, humsm adenovirus Ad40 or any adenovirus antigenically 
or structureJ.ly similar to human Ad41 in infected cells 
in a. sanple comprising lysing said cells, fixing the DNA of the 
infected calls and detecting the DNA containing said long 
30 fiber protein gene, short fiber protein gene or E3 gene by a specific 
probe nucleic acid wherein said probe nucleic acid is DNA, cDNA, 
recombinant DNA or RNA. 
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1 49. A compartmentalized kit for detection of 

human adenovirus type 41 comprising at least one first 
container adapted to contain an antibody having specificity 
for said Ad41 long fiber protein, short fiber protein or 

5 E3 proteins, .RL-1 to RL-6 and at least one second container 
adapted to contain a reporter molecule capable of detecting 
the antibody of said first container. 

50. The kit of Claim 49 wherein the reporter 
molecule is a radioisotope, an enzyme, a fluorescent 
20 molecule, a chemiluminescent molecule or a bioluminescent 
molecule . 



15. 



20 
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10 



20 



30 



40 



50 



6C 



CCCGGGCAAC ATGCTCATCC AAATCTCGCC TAACATCACC TTCAGTGTCG TCTACAACGA 
GGGCCCGTTG TACGAGTAGG TTTAGAGCGG ATTGTAGTGG AAGTCACAGC AGATGTTGCT 



5 Smal 



70 



80 



90 



100 



110 



120 



GATAAACAGT GGGTATGCTT TTACTTTTAA ATGGTCAGCC GAACCGGGA.^ AACCTTTTCA 
10 CTATTTGTCA CCCATACGAA AATGAAAATT TACCAGTCGG CTTGGCCCTT TTGGAAAAGT 



15 



130 

* * 

CCCACCTACC 
GGGTGGATGG 
190 



140 



150 



160 



170 



180 



GCTGTATTTT GCTACATAAC TGAACAATAA AATCATTGCA GGCACAATCT 
CGACATAAAA CGATGTATTG ACTTGTTATT TTAGTAACGT CCGTGTTAGA 



200 



210 



220 



230 



TCGCATTTCT TTTTTTCCAG ATG AAA CGA GCC AGA CTT GAA GAT GAC TTC AAC CCC 
AGCGTAAAGA AAAAAAGGTC TAG TTT GC7 CGG TCT GAA CTT CTA CTG AAG TTG GGG 
20 Met Lys Arg Ala Arg Leu Glu Asp Asp Phe Asn Pro 

60.6 KD FIBER PROTEIN 
240 250 260 270 280 



25 



GTC TAG CCT TAG GAA CAC TAG AAT CCC CTT GAC ATC CCA TTT ATT ACA CCC 
CAG ATG GGA ATG CTT GTG ATG TTA GGG GAA CTG TAG GGT AAA TAA TGT GGG 
Val Tyr Pro Tyr Glu His Tyr Asn Pro Leu Asp lie Pro Phe lie Thr Pro 

60.6 KD FIBER PROTEIN 
230 300 310 320 330 



30 



CCG TTT GCC TCC TCC AAC GGC TTG CAA GAA AAA CCA CCG GGA GTC CTC AGC 
GGC AAA CGG AGG AGG TTG CCG AAC GIT CTT TTT GGT GGC CCT CAG GAG TCG 
Pro Phe Ala Ser Ser Asn Gly Leu Gin Giu Lys Pro Pro Gly Val Leu Ser 

60.6 KD FIBER PROTEIN 
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340 350 360 370 380 



CTG AAA TAG ACT GAT CCA CTT ACA ACC AAA AAC GGG GCT TTA ACC TTA AAA 
5 GAC TTT ATG TGA CTA GGT GAA TGT TGG TTT TTG CCC CGA AAT TGG AAT TTT 
Leu Lys Tyr Thr Asp Pro Leu Thr Thr Lys Asn Gly Ala Leu Thr Leu Lys 

60.6 KD FIBER PROTEIN 
390 400 410 420 430 440 

******** *i 
10 CTG GGC ACG GGA CTA AAC ATT GAT GAA AAT GGA GAT CTT TCT TCA GAT GCT 
GAC CGG TGC CGT GAT TTG TAA CTA CTT TTA CCT CTA GAA AGA AGT CTA CGA 
Leu Gly Thr Gly Leu Asn He Asp Glu Asn Gly Asp Leu Ser Ser Asp Ala 

60.6 KD FIBER PROTEIN 
450 460 470 480 490 

3^5* ** * ** * ** 

AGC GTG GAA GTT AGC GCC CCT ATT ACT AAA ACC AAC AAA ATC GTA GGT TTA 
TCG CAC CTT CAA TCG CGG GGA TAA TGA TTT TGG TTG TTT TAG CAT CCA AAT 
Ser Val Glu Val Ser Ala Pro He" Thr Lys Thr Asn Lys He Val Gly Leu 

60.6 KD FIBER PROTEIN 
20 500 510 520 530 540 



AAT TAG ACT AAA CCT CTC GCC CTG CGA AGT AAC GCG CTC ACT CTT TCT TAC 
TTA ATG TGA TTT GGA GAG CGG GAC GCT TCA TTG CGC GAG TGA GAA AGA ATG 
25 Asn Tyr- Thr Lys Pro Leu Ala Leu Arg Ser Asn Ala Leu Thr Leu Ser Tyr 

60/6 KD FIBER PROTEIN 
550 560 570" 580 590 



30 AAC GCA CCC TTA AAC GTA GTA AAT AAC AAT TTA GCT TTA AAT ATC TCA CAA 

TTG CGT GGG AAT TTG CAT CAT TTA TTG TTA AAT CGA AAT TTA TAG AGT GTT 

Asn Ala Pro Leu Asn Val Val Asn Asn Asn Leu Ala Leu Asn 'lie Ser Gin 

60*. 6 KD FIBER PROTEIN 
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600 610 620 630 640 



30 



***** * 



CCT GTC ACT GTT AAT GCA AAC AAC GAA CTT TCT CTC TTA ATA GAC GCC CCA 
5 GGA CAG TGA CAA TTA CGT TTG TTG CTT GAA AGA GAG AAT TAT CTG CGG GGT 
Pro Val Thr Val Asn Ala Asn Asn Glu Leu Ser Leu Leu lie As? Ala Pro 

. 60.6 KD FIBER PROTEIN 
650 660 670 680 690 

* 

10 CTT AAT GCT GAC ACG GGC ACT CTT CGC CTT CAA AGT GCT GCA CCT CTT GGA 
GAA TTA CGA CTG TGC CCG TGA GAA GCG GftA GTT TCA CGA CGT GGA GAA CCT 
Leu Asn Ala Asp Thr Gly Thr Leu Arg Leu Gin Ser Ala Ala Pro Leu Gly 

60.6 KD FIBER PROTEIN 
700 710 720 730 740 • 

15********** 

CTA GTG GAC AAA ACA CTA AAA GTT TTG TTT TCT AGC CCC CTC TAT CTA GAT 
GAT CAC CTG TTT TGT GAT TTT CAA AAC AAA AGA TCG GGG GAG ATA GAT CTA 
Leu Val Asp Lys Thr Leu Lys Val Leu Phe Ser Ser Pro I^u Tyr Leu Asp 

20 60.6 KD FIBER PROTEIN 

750 760 770 780 790 

* • ♦ * * * * * * * 

AAT AAC TTT CTT ACA CTA GCC ATT GAA CGC CCG CTA GCT CTA TCC AGT AGC 

TTA TTG AAA GAA TGT GAT CGG TAA CTT GCG GGC GAT CGA GAT AGG TCA TCG 

25 Asn ASH Phe Leu Thr Leu Ala He Glu Arg Pro Leu Ala Leu Ser Ser Ser 

60.6 KD FIBER PROTEIN 
800 810 820 830 840 



AGA GCA GTG ACC CTT AAG TAT TCA CCA CCT TTA AAA ATA GAA AAC GAA AAC 
TCT CGT CAC TGG GAA TTC ATA AGT GGT GGA AAT TTT TAT CTT TTG CTT TTG 
Arg Ala Val Thr Leu Lys Tyr Ser Pro Pro Leu Lys He Glu Asn Glu Asn 

60.6 KD FIBER PROTEIN 
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1 850 860 870 880 890 

********** 

TTA ACC CTA AGC ACA GGC GGG CCT TTT ACT GTA AGC GGG GGA AAT CTA AAC 
AAT TGG GAT TCG TGT CCG CCC GGA AAA TGA CAT TCG CCC CCT TTA GAT TTG 
5 Leu Thr Leu Ser Thr Gly Gly Pro Phe Thr Val Ser Gly Gly Asn Leu Asn 

60,6 KD FIBER PROTEIN 
900 .910 920 930 940 950 

TTA ACA ACA TCG GCA CCT CTC TCC GTG CAA AAC AAC TCT CTC TCC TTA GTC 
10 AAT TGT TGT AGC CGT GGA GAG AGG CAC GTT TTG TTG AGA GAG AGG AAT GAG 
Leu Thr Thr Ser Ala Pro Leu Ser Val Gin Asn Asn Ser Leu Ser Leu Val 

60.6 KD FI3ER PROTEIN 

960 970 930 990. 1000 

* *'* ★ ** * *.* 

i 

15 ATT ACT TCT CCT TTA AAA GTT ATT AAT TCT ATG TTA GCC GTT GGG GTT AAC 
TAA TGA AGA GGA AAT TTT CAA TAA TTA AGA TAG AAT GGG CAA CCC CAA TTG 
He Thr Ser Pro Leu Lys Val He Asn Ser Met Leu Ala Val Gly Val Asn 

60.6 KD FIBER PROTEIN 
1010 1020 1030 1040 1050 

2Q * * * ★ * * * -K ★ * 

CCG CCT TTT ACC ATC ACT GAC TCT GGA TTA GCT ATG GAC TTA GGA GAC GGT 
GGC GGA AAA TGG TAG TGA CTG AGA CCT AAT CGA TAC CTG AAT CCT CTG CCA 
Pro Pro Phe Thr He Thr Asp Ser Gly Leu Ala Met Asp Leu Gly Asp Gly 

25 . ._ . 60 . 6 KD FIBER PROTEIN 

Figure 1 - Cont'd 
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1 1060 1070 1080 1090 1100 

CTT GCA CTA GGT GGC TCT AAG TTA ATA ATC AAT CTT GGT CCA GGT TTA CAA 
GAA CGT GAT CCA CCG AGA TTC AAT TAT TAG TTA GAA CCA GGT CCA AAT GTT 
5 Leu Ala Leu Gly Gly Ser Lys Leu Tie lie Asn Leu Gly Pro Gly Leu Glr. 

60.6 KD FIBER PROTEIN 
1110- 1120 1130 1140 1150 

* * * «.* * * * * 

ATG TCT AAT GGA GCT ATT ACT TTA GCA CTA GAT GCA 3CG CTG CCT TTG CAA 
10 TAC AGA TTA CCT CGA TAA TGA AAT CGT GAT CTA CGT CGC GAC GGA AAC GTT 
Met Ser Asn Gly Ala He Thr Leu Ala Leu Asp Ala Ala Leu Pro Leu Gin 

60.6 KD FIBER PROTEIN 
1160 1170 1180 1190 ;L200 

* * * * * * * * 

15 TAT AGA GAC AAC CAA CTT CAA CTC AGA ATT GGC TCA ACA TCT GGC TTA ATT 
ATA TCT CTG TTG GTT GAA GTT GAG TCT TAA CCG AGT TGT AGA CCG AAT TAA 
Tyr Arg Asp Asn Gin Leu Gin Leu Arg He Gly Ser Thr Ser Gly Leu He 

60.6 KD FIBER PROTEIN 
1210 1220 1230 1240 1250 

20********* 

ATG AGC GGA GTA ACA CAA ACA TTA AAC GTC AAT GCC AAT ACC GGC AAA GGT 
TAC TCG CCT CAT TGT GTT TGT AAT TTG CAG TTA CGG TTA TGG CCG TTT CCA 
Met Ser Gly Val Thr Gin Thr Leu Asn Vai Asn Ala Asn Thr Gly Lys Gly 

60.6 KD FIBER PROTEIN 
25 • 1260 1270 1280 1290 1300 

* * * * *•* * ♦ ir 

CTT GCT GTT GAA AAC AAC TCA CTA GTT GTT AAG CTT GGG AAC GGT CTT CGC 
GAA CGA CAA CTT TTG TTG AGT GAT CAA CAA TTC GAA CCC TTG CCA GAA GCG 
Leu Ala Val Glu Asn Asn Ser Leu Val Val Lys Leu Gly Asn Gly Leu Arg 
30 60.6 KD FIBER PROTEIN 

1310 1320 1330 1340 1350 

* * * * * * ★ « ** 

TTT GAT AGC TGG GGA AGC ATA ACT GTC TCG CCT ACT ACC ACT ACC CCT ACC 
AAA CTA TCG ACC CCT TCG TAT TGA CAG AGC GGA TGA TGG TGA TGG GGA TGG 
35 Phe Asp Ser Trp Gly Ser He Thr Val Ser Pro Thr Thr Thr Thr Pro Thr 

60.6 KD FIBER PROTEIN 

Figure 1 - Cont'd 
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1 1560 1370 1380 1390 1400 

********** 



ACC CTA TGG ACC ACC GCA GAC CCA TCA CCT AAC GCC ACT TTT TAT GAA TC^ 

TGG GAT ACC TGG TGG CGT CTG GGT AGT GGA TTG CGG TGA AAA ATA CTT AGT 

5 Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Ala Thr Phe Tyr Glu Ser 

60.6 KD FIBER PROTEIN 

1410 1420 1430 1440 1450 1460 

******* **** 

CTA GAC GCC AAA GTG TGG CTA GTT TTA GTA AAA TGC AAC GGC ATG GTT AAC 
10 GAT CTG CGG TTT CAC ACC GAT CAA AAT CAT TTT ACG TTG CCG TAC CAA TTG 
Leu Asp Ala Lys Vai T-rp Leu Val Leu Val Lys Cys Asn Gly Met Val Asn 

60.6 KD FIBER PROTEIN 
1470 1480 1490 1500 . 1510 

* * * * * A * * * *j" * 

15 GGG ACC ATA TCC ATT AAA GCT CAG AAA GGC ATT TTA CTT AGA CCT ACA GCT 
CCC TGG TAT AGG TAA TTT CGA GTC TTT CCG TAA AAT GAA TCT GGA TGT CGA 
Gly Thr He Ser He Lys Ala Gin Lys Gly He Leu Leu Arg Pro Thr Ala 

60.6 KD FIBER PROTEIN 
1520 1530 1540 1550 1560 

20 ************ 
AGT TIT ATT TCC TTT GTC ATG TAT TTC TAC AGC GAT GGA ACA TGG AGA AAA 
TCA AAA TAA AGG AAA CAG TAC ATA AAG ATG TCG CTA CCT TGT ACC TCT TTT 
Ser Phe lie Ser Phe Val Met Tyr Phe Tyr Ser Asp Gly Thr Trp Arg Lys 

60.6 KD FIBER PROTEIN 

25 * ■- * 1570 1580 1590 1600 1610 

********* 

AAC TAT CCC GTG TTT GAC AAC GAA GGG ATA CTA GCA AAC AGT GCC ACG TGG 
TTQ ATA GGG CAC AAA CTG TTG CTT CCC TAT GAT CGT TTG TCA CGG TGC ACC 
Asn Tyr Pro Val Phe Asp Asn Glu Gly He Leu Ala Asn Ser Ala Thr Trp 
30 . 60.6 KD FIBER PROTEIN 

1620 1630 1640 1650 1660 

* * * * * « * * * * 

GGT TAT CGA CAA GGA CAG TCT GCC A^C ACT AAC GTT TCT AAT GCT GTA GAA 
CCA ATA GCT GTT CCT GTC AGA CGG TTG TGA TTG CAA AGA TTA CGA CAT CTT 
35 Gly Tyr Arg Gin Gly Gin Ser Ala Asn Thr Asn Val Ser Asn Ala Val Glu 

60.5 KD FIBER PROTEIN 
Figure 1 - Conr' d 
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1 1670 1680 1690 1700 1710 

TTT ATG CCT AGC TCT AAA AGA TAT CCC AAT CAA AAA GGT TCT GAA GTT CAG 
AAA TAG GGA TCG AGA TTT TCT ATA GGG TTA GTT TTT CCA AGA CTT CAA GTC 
5 Phe Met Pro Ser Ser Lys Arg Tyr Pro Asn Gin Lys Gly Ser Glu Val Gin 

60,6 KD FIBER PROTEIN 
.1720- 1730 1740 1750 1760 

★ *★**★** 

AAC ATG GCT CTT ACC TAG ACT TTT TTG CAA GGT GAT CCT AAC ATG GCC ATA 
10 TTG TAC CGA GAA TGG ATG TGA AAA AAC GTT CCA CTA GGA TTG TAC CGG TAT 
Asn Met Ala Leu Thr Tyr Thr Phe Leu Gin Gly Asp Pro Asn Met Ala lie 

60.6 KD FIBER PROTEIN 



15 



1770 1780 1790 1800 1810 / 

********* 

TCC TTT CAG AGT ATT TAT AAT CAT GCA TTA GAA GGG TAC TCA TTA AAA TTT 
AGG AAA GTC TCA TAA ATA TTA GTA CGT AAT CTT CCG ATG AGT AAT TTT AAA 
Ser Phe Gin Ser lie Tyr Asn His Ala Leu Glu Gly Tyr Ser Leu Lys Phe 

60.6 KD FIBER PROTEIN 

20 1820 1830 1840 1850 1860 

********* 

ACC TGG CGC GTT CGA AAT AAT GAA CGT TTT GAC ATC CCC TGC TGC TCA TTT 
TGG ACC GCG CAA GCT TTA TTA CTT GCA AAA CTG TAG GGG ACG ACG AGT AAA 
Thr Trp Arg Val Arg Asn Asn Glu Arg Phe Asp lie Pro Cys Cys Ser Phe 
25 . 60-6 KD FIBER PROTEIN 

1870 1880 1890 1900 1910 1920 

******* ** 

TCT TAT GTA ACA GAA CAA TAA A ATATTGTTGT TTTTGTTTTT ATAACTTTAT 
AGA ATA CAT TGT CTT GTT ATT T TATAACAACA AAAACAAAAA TATTGAAATA 
30 Ser Tyr Val Thr Glu Gin End 

60.6 KD FIBER PROTEIN 

1930 
* * ★ 

TGATACTTTT ACAGAATTC 
35 ACTATGAAAA TGTCTTAAG 

EcoRI 
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1 10 20 30 40 50 60 

********* * «« 

GATATCAGTT GTTTGTCAAG TTTTTCCAGC AGCACCACCT GCCCTTCCTC CCAACTTTCG 

CTATAGTCAA CAAACAGTTC AAAAAGGTCG TCGTGGTGGA CGGGAAGGAG GGTTGAAAGC 

5 70 80 90 100 110 120 

*★★**★****«♦ 

TAGGGGATGT GCCAACGGGC AGCAAACTTT CTCCACGTCC TAAAGGGTAT ATCGGTGTTC 
ATCCCCTACA CGGTTGCCCG TCGTTTGAAA GAGGTGCAGG ATTTCCCATA TAGCCACAAG 
130 140 130 160 170 180 

J^Q ★*★****★***♦ 
ACCTTTTTAC CCTGACCCAC GATCTTCATC TTGCAGAISA AAAGAACCAG AATTGAAGAC 
TGGAAAAATG GGACTGGGTG CTAGAAGTAG AACGTCTACT TTTCTTGGTC TTAACTTCTG 
190 200 210 220 . 230 * 240 

* * * * * * * * * * ★^•-* 

15 GACTTCAACC CCGTCTACCC CTATGACACC TTCTCAACTC CCAGCATCCC CTATGTAGCT 
CTGAAGTTGG GGCAGATGGG GATACTGTGG AAGAGTTGAG GGTCGTAGGG GATACATCGA 
250 260 270 280 290 300 

************ 

CCGCCCTTCG TTTCTTCTGA CGGGTTACAG GAAAAACCCC CAGGAGTTTT AGCACTCAAG 

20 GGCGGGAAGC AAAGAAGACT GCCCAATGTC CTTTTTGGGG GTCCTCAAAA TCGTGAGTTC 

310 320 330 340 350 360 

* * * * * * * * * * * ■* 

TACACTGACC CCATTACTAC CAATGCTAAG CATGAGCTTA CTTTAAAACT TGGAAGCAAC 
ATGTGACTGG GGTAATGATG GTTACGATTC GTACTCGAAT GAAATTTTGA ACCTTCGTTG 
25 370 380 390 400 410 420 

************ 

ATAACTTTAG AAAATGGGTT ACTTTCGGCC ACAGTTCCCA CTGTTTCTCC TCCCCTTACA 
TATTGAAATC TTTTACCCAA TGAAAGCCGG TGTCAAGGGT GACAAAGAGG AGGGGAATGT 
430 440 430 460 470 480 

************ 

AACAGTAACA ACTCCCTGGG TTTAGCCACA TCCGCTCCCA TAGCTGTATC AGCTAACTCT 
TTGTCATTGT TGAGGGACCC AAATCGGTGT AGGCGAGGGT ATCGACATAG TCGATTGAGA 



30 
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10 



30 



490 



500 510 520 530 540 



CTCACATTGG CCACCGCCGC ACCACTGACA GTAAGCAACA ACCAGCTTAG TATTAACGCG 
GAGTGTAACC GGTGGC3GCG TGGTGACTGT CATTCGTTGT TGGTCGAATC ATAATTGCGC 
550 560 570 580 590 6C0 

* 

GGCAGAGGrr-TAGTrATAAC TAACAATGCC TTAACAGTTA ATCCTACCGG AGCGCTAGGT 
CCGTCTCCAA ATCAATATTG ATTGTTACGG AATTGTCAAT TAGGATGGCC TCGCGATCCA 
610 620 630 640 650 650 

****** 
TTCAATAACA CAGGAGCTTT ACAATTAAAT GCTGCAGGAG GAATGAGAGT GGACGGTGCC 
AAGTTATTGT GTCCTCGAAA TGTTAATTTA CGACGTCCTC CTTACTCTCA CCTGCCACGG 
670 680 690 700 . 710 * 720 

* « * * * * * * * *i* 

15 AACTTAATTC TTCATGTAGC ATATCCCTTT GAAGCAATCA ACCAGCTAAC ACTGCGATTA 
TTGAATTAAG AAGTACATCG TATAGGGAAA CTTCGTTAGT TGGTCGATTG TGACGCTAAT 
730 740 750 760 770 780 



* 



********* 



GAAAACGGGT TAGAAGTAAC CAGCGGAGGA AAGCTTAACG TTAAGTTGGG ATCAGGCCTC 
20 CTTTTGCCCA ATCTTCATTG GTCGCCTCCT TTCGAATTGC AATTCAACCC TAGTCCGGAG 
790 800 810 820 830 840 

********** 
CAATTTGACA GTAACGGACG CATTGCTATT AGTAATAGCA ACCGAACTCG AAGTGTACCA 
GTTAAACTGT CATTGCCTGC GTAACGATAA TCATTATCGT TGGCTTGAGC TTCACATGGT 
25 • • 850 860 870 880 890 900 

*********" 
TCCCTCACTA CCATTTGGTC TATCTCGCCT ACGCCTAACT GCTCCATTTA TGAAACCCAA 
AGGGAGTGAT GGTAAACCAG ATAGAGCGGA TGCGGATTGA CGAGGTAAAT ACTTTGGGTT 
910 920 930 940 950 960 

**********"" 
GATGCAAACC TATTTCTTTG TCTAACTAAA AACGGAGCTC ACGTATTAGG TACTATAACA 
CTACGTTTGG ATAAAGAAAC AGATTGATTT TTGCCTCGAG TGCATAATCC ATGATATTGT 
970 980 990 1000 ICIO 1020 



* 



* 



35 ATCAAAGGTC TTAAAGGAGC ACTGCGGGAA ATGCACGATA ACGCTCTATC TTTAAAACTT 

TAGTTTCCAG AATTTCCTCG TGACGCCCTT TACGTGCTAT TGCGAGATAG AAATTTTGAA 



Figura 2 - Cont'd 
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10 



35 



1030 1040 1050 1060 1070 1080 

************ 
CCCTTTGACA ATCAGGGAAA TTTACTTAAC TGTGCCTTGG AATCATCCAC CTGGCGTTAC 
GGGAAACTGT TAGTCCCTTT AAATGAATTG ACACGGAACC TTAGTAGGTG GACCGCAATG 

1090 1100 1110 1120 1130 1140 

CAGGAAACCA - ACGCAGTGGC CTCTAATGCC TTAACATTTA TGCCCAACAG TACAGTGTAT 

GTCCTTTGGT TGCGTCACCG GAGATTACGG AATTGTAAAT ACGGGTTGTC ATGTCACATA 

1150 1160 . 1170 1180 1190 1200 
************ 

CCACGAAACA AAACCGCTCA CCCGGGCAAC ATGCTCATCC AAATCTCGCC TAACATCACC 

GGTGCTTTGT TTTGGCGAGT GGGCCCGTTG TACGAGTAGG TTTAGAGCGG ATTGTAGTGG 

1210 1220 1230 1240 ;1250 1260 



* * * 



* 



i 

15 TTCAGTGTCG TCTACAACGA GATAAACAGT GGGTATGCTT TTACTTTTAA ATGGTCAGCC 
AAGTCACAGC AGATGTTGCT CTATTTGTCA CCCATACGAA TU^TGAAAATT TACCAGTCGG 
1270 1280 1290 1300 1310 1320 

*tt********** 
GAACCGGGAA AACCTTTTCA CCCACCTACC GCTGTATTTT GCTACATAAC TGAACAAIM 
20 CTTGGCCCTT TTGGAAAAGT GGGTGGATGG XGACATAAAA CGATGTATTG ACTTGTTATT 
1330 1340 1350 ' 1360 1370 1380 

^^*********** 

AATCATTGCA GGCACAATCT TCGCATTTCT TTTTTTCCAG ATGAAACGAG CCAGACTTGA 
TTAGTAACGT CCGTGTTAGA AGCGTAAAGA AAAAAAGGTC TACTTTGCTC GGTCTGAACT 
25 . 1390 1400 

* * w * 

AGATGACTTC AACCCCGTCT AC 
TCTACTGAAG TTGGGGCAGA TG 
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1 Met Lys Arg Thr Arg lie Giu Asp 
Thr Phe Ser Thr Pro Ser He ?ro 
Asp Giy Leu Gin Giu Lys Pro Pro 
Pro lie Thr Thr Asn Aia Lys His 
5 lie Thr Leu Giu Asn Giy Leu Leu 
Pro Leu Thr Asn Ser Asn Asn Ser 
Aia Vai Ser Aia Asn Ser Leu Thr 
Ser Asn Asn Gin Leu Ser lie Asn 
Asn Aia Leu Thr Vai Asn Pro Thr 
10 Aia Leu Gin Leu Asn Aia Aia Giy 
lie Leu His Vai Aia Tyr Pro Phe 
Leu Giu Asn Giy Leu Giu Vai Thr 
Giy Ser Giy Leu Gin Phe Asp. Ser 
Asn Arg Thr Arg Ser Vai Pro Ser 
15 Thr Pre Asn Cys Ser lie Tyr Giu 
Leu Thr Lys Asn Giy Aia His Vai 
Lys Giy Aia Leu Arg Giu Met His 
Phe Asp Asn Gin Giy Asn Leu Leu 
Arg Tyr Gin Giu Thr Asn Aia Vai 
20 Asn Ser Thr Vai Tyr Pro Arg Asn 
lie Gin lie Ser Pro Asn lie Thr 
Ser Giy Tyr Aia Phe Thr Phe Lys 
His Pro Pro Thr Aia Vai Phe Cys 

25 



Asp Phe Asn Pro Vai Tyr Pro Tyr Asp 
Tyr Vai Aia Pro Pro Phe Vai Ser Ser 
Giy Vai Leu Aia Leu Lys Tyr Thr Asp 
Giu Leu Thr Leu Lys Leu Giy Ser Asn 
Ser Aia Thr Vai Pro Thr Vai Ser Pro 
Leu Giy Leu Aia Thr Ser Aia Pro lie 
Leu Aia Thr Ala Ala Pro Leu Thr Vai 
Aia Giy Arg Giy Leu Vai lie Thr Asn 
Giy Aia Leu Giy Phe Asn Asn Thr Giy 
Giy Met Arg Vai Asp Giy Aia Asn Leu 
Giu Ala lie Asn Gin Leu Thr Leu Arg 
Ser Giy Giy Lys Leu Asn Vai Lys Leu ^ 
Asn Giy Arg lie Aia He, Ser Asn Ser 
Leu Thr Thr lie Trp Ser lie Ser PrO( 
Thr Gin Asp Ala Asn Leu Phe Leu Cys 
Leu Giy Thr He Thr lie Lys Giy Leu 
Asp Asn Ala Leu Ser Leu Lys Leu Pro 
Asn Cys Ala Leu Giu Ser Ser Thr Trp 
Ala Ser Asn Ala Leu Thr Phe Met Pro 
Lys Thr Ala His Pro Giy Asn Met Leu 
Phe Ser Vai Vai Tyr Asn Giu He Asn 
Trp Ser Aia Giu Pro Giy Lys Pro Phe 
Tyr lie Thr Giu Gin 
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10 



15 



20 



25 



30 



35 



10 20 30 40 50 60 

GAATTCGCGC CACTCGAAAC CAAATTTTGC TGGAGCAAGC TGCCCTGACC TCCACCCCGC 
CTTAAGCGCG GTGAGCTTTG GTTTAAAACG ACCTCGTTCG ACGGGACTGG AGGTGGGGCG 

70 80 90 100 110 120 

* * * ★ * * * * * 
GAAGTCAATT GAACCCGCCC AATTGGCCCG CTGCCCAGGT GTATCAGGAA AACCCCGCTC 
CTTCAGTTAA CTTGGGCGGG TTAACCGGGC GACGGGTCCA CATAGTCCTT TTGGGGCGAG 

130 140 150 160 170 180 

* * * ♦ * * * * 

CGACCACAGT TCTCCTGCCA CGCGACGCTG AGGCCGAAGT CCAAATGACT AACTCCGGAG 

GCTGGTGTCA AGAGGACGGT GCGCTGCGAC TCCGGCTTCA GGTTTACTGA TTGAGGCSCTC 

190 200 210 220 230 240 

* ********* 
CGCAATTAGC GGGCGGATCC AGACACGTCA GGTTCAGAGG TCGGTCCTCG CCCTACTCTC 
GCGTTAATCG CCCGCCTAGG TCTGTGCAGT CCAAGTCTCC AGCCAGGAGC GGGATGAGAG 

250 260 270 280 290 300 

********** 

CAGGTCCTAT AAAGAGGCTG ATTATCCGAG GCCGGGGTAT CCAGCTCAAC GACGAAGTGG 
GTCCAGGATA TTTCTCCGAC TAATAGGCTC CGGCCCCATA GGTCGAGTTG CTGCTTCACC 

310 320 330 340 350 360 

*** ******* 

TGAGCTCCTT AACCGGTCTC CGACCTGACG GAGTTTTCCA GCTTGGAGGT GCCGGCCGCT 
ACTCGAGGAA TTGGCCAGAG GCTGGACTGC CTCAAAAGGT CGAACCTCCA CGGCCGGCGA 

370 380 390 400 410 420 

********** 

CCTCCTTCAC TCCTCGCCAG GCGTACCTGA CACTCCAGAG CTCTTCTTCC CAGCCTCGCT 
GGAGGAAGTG AGGAGCGGTC CGCATGGACT GTGAGGTCTC GAGAAGAAGG GTCGGAGCGA 

Figure 4 
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^ 430 440 450 460 470 4S0 

CC3GCGGCAT TGGAACCCTC CAGTTTGTGG AGGAGTTTGT ACCCTCCGTT TACTTCAACC 

GGCCGCCGTA ACCTTG3GAG GTCAAACACC TCCTCAAACA TGGGAGGCLA ATGAAGTTGG 

5 

490 500 510 520 530 540 



^, * * * « • * * 



CATTCTCGGG CGCTCCTGGT CTTTACCCAG ACGACTTCAT CCCAAACTAC GACGCGGTGA 

GTAAGAGCCC GCGAGGACCA GAAATGGGTC TGCTGAAGTA GGGTTTGATG CTGCGCCACT 

10 

550 560 570 580 590 600 

* . * * * * • * * . * 

GCGAATCTGT GGACGGCTAC GACTGAATCC CAATGGTGCG TCCGTGACTG TGTGGCTGCA 

CGCTTAGACA CCTGCCGATG CTGACTTAGG GTTACCACGC AGGCACTGAC ACACC^ACGT 

15 

610 620 630 640 650 660 



******** 



ACATCTACAT CGGCGCCCTA ATCCTTGCTA CTTTGTCTGA AAAGTCTGTG ATTTTTACTT 

TGTAGATCTA GCCGCGGCAT TAGGAACGAT GAAACAGACT TTTCAGACAC TAAAAATGAA 

20 

670 680 690 700 710 720 

ACCGCTCCAG CGCTTGGATT ACATGAAGAT CTGTGTTCTT TTTTGTGTGC TAAGTTTAAC 
TGGCGAGGTC GCGAACCTAA TGTACTTCTA GACACAAGAA AAAACACACG ATTCi>AATTG 

25 

730 740 750 760 770 780 

* 

AAGTAGCCTA AGGACTTOVC CTACAACCGT TGGTTCCTTA CGTCAGCTAC AAGATTCCAC 
TTCATCGGAT TCCTGAAGTG GATGTTGGCA ACCAAGGAAT GCAGTCGATG TTCTAAGGTG 

30 

790 800 810 820 830 840 

* ***-***'* 
CAAAGGTACA CACCAAACTC TTTATTTTTC TGAGTCTACC ACnCTATTG CACTTAACTG 
GTTTCCATGT GTGGTTTGAG AAATAAAAAG ACTCAGATGG TGAAGATAAC GTGAATTGAC 

35 

Figure 4- Cont'd 
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850 860 870 880 890 900 

********** 
TTCTTGTCGT AACCAACTCG TTCAGTGGCG CGCTAACAGA CAATTTTGCA AACTATTTTG 
AAGAACAGCA TTGGTTGAGC AAGTCACCGC GCGATTGTCT GTTAAAACGT TTGATAAAAC 

910 920 930 940 950 950 

*.*-* * * * * •» * * 

GGACGCTCTT ATTGTTCAAG GAAACAACAG CCTTTGTAAC AACTGTACTG CTACTACTTT 

CCTGCGAGAA TAACAAGTTC CTTTGTTGTC GGAAACATTG TTGACATGAC GATGATGAAA 

570 980 990 1000 1010 1020 

*♦*««**•** 
AACTCTTACA CCTCCTTTTG TTCCCGG7CC ATACTTGTGC ATTGCgACAG GAAGAGGGCC 
TTGAGAATGT GGAGGAAAAC AAGGGCCAGG TATGAACACG TAACCGTGTC CTTCTaCCGG 

1030 1040 1050 106 1070 lOSO 

********** 
TAGCTGCTTT AATCGCTGGA CTTTACAAAA AGAGAACCTA ACCACTACCA CCCTCCTTCC 
ATCGACGAAA TTAGCGACCT GAAATGTTTT TCTCTTGGAT TGGTGATGGT GGGAGGAAGG 

1090 1100 1110 1120 1130 1140 

* '* * « * * * * * * 

CCTTACTACT TATACTTTTT CCCAAAAAAA AATTTACTTT TTGCCCATTA TTGCACTTTT 

GGAATGATGA ATATGAAAAA GGGTTTTTTT TTAAATGAAA AACGGGTAAT AACGTGAAAA 

1150 1160 1170 1180 1190 1200 

********** 
GGCCTTTGTC TGTGTTATTA CCGCTAATTA CATTTTAATT TTCAATCTTG ATAATTTTTA 
CCGGAAACAG ACACAATAAT GGGGATTAAT GTAAAATTAA AAGTTAGAAC TATTAAAAAT 

1210 1220 1230 1240 1250 1260 

********** 
CTAATCATGC TGCTGTTTTT ACTTTGCCTT CTTTTCTGCT CTGCCTATGC CGCCGTGCCA 
. GATTAGTACG ACGACAAAAA TGAAACGGAA GAAAAGACGA GACGGATACG GCGGCACGGT 

Figure 4 - Con'd 
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WO 91/08310 . , 

1,15/27 



1270 



1280 1290 1300 1310 1320 



GAAAAAACTC TTAACAACCT CGTTCGGGTG TATGCCTTAG TTGGTACCAA TCTATCCCTT 

CTTTTTTGAG AATTGTTGGA GCAAGCCCAC ATACGGAATC AACCATGGTT AGATAGGGAA 

5 

1330 1340 1350 1360 1370 1380 

« . * * * * * * 

GATTCTATGA AAACTCCTCA GATTGACGAA CTTACTAGTC TTAGCTGGAT TAAACAGGAA 

CTAAGATACT TTTGAGGAGT CTAACTGCTT GAATGATCAG AATCGACCTA ATTTGTCCTT 

10 1390 1400 1410 1420 1430 1440 

* * * * * 

GACAATCCTA ACAAAAACTT ACAATCATTT TTTTTTATTG GTCAAAAACT CTGTGAAGTT 

CTGTTAGGAT TGTTTrTGAA TGTTAGTAAA AAAAAATAAC CAGTTTTTGA GACACTTCAA 



15 1450 1460 1470 1480 1490 1500 

ACCAAAGACA AAATCACTGT TTTTAACTAT TATCCGTTGG AATTTTCCTG CGCTAACGTA 

TGGTTTCTGT TTTAGTGACA AAAATTGATA ATAGGCAACC TTAAAAGGAC GCGATTGCAT 

20 1510 1520 1530 1540 1550 1560 
********* 

ACCTTGTATI TGTATAATCT TAAAACTGAC GATTCTGGCC TCTATAATGG AAAGGCCCAT 

TGGAACATAA ACATATTASA ATTTTGACTG CTAAGACCGG AGATATTACC TTTCCGGGTA 

25 ■ -■ 1570 1580 1590 1600 1610 1620 



* 



******* 



ACCAAAGAGC TTGAACATAA CACCTATGTT AGGCTTTATG TTATTGACAT TCCTCCGCCT 
TGGTTTCTCG AACTTGTATT GTGGATACAA TCCGAAATAC AATAACTGTA AGGAGGCGGA 



30 1630 



1640 1650 1660 1670 1680 



* 

AAGTGTGACA TTACTTCACG TTACTTAGGC ATACAGGCTA CTGGGGAAGA TTATTGTTTA 

TTCACACTGT AATGAAGTGC AATGAATCCG TATGTCCGAT GACCCCTTCT AATAACAAAT 

25 Figure 4 - Cont'd 
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1690 1700 1710 1720 1730 1740 

* * * * * . * 

ATTGAAATCA ATTGCACTAA CTCCAAATAC CCAGCTGTGG TTAAATTTAA TGGCAGGCAA 

TAACTTTAGT TAACGTGATT GAGGTTTATG GGTCGACACC AATTTAAATT ACCGTCCGTT 

1750 1760 1770 1780 1790 ISOO 

* * * w * * * 



* 



AGCAACTTCT ACCATTATGT TAGCGAAAAC GGAAACAAAG AACTTCCAAA TTTTTATGAA 
TCGTTGAAGA TGGTAATACA ATCGCTTTTG CCTTTGTTIC TTGAAGGTTT AAAAATACTT 



10 



1310 



1820 1830 1840 1850 I860 



ACACACATCA CTGTTAATGG TACCCACAAA AGCTTTCACT TTAATTACCC TTTTAACGAC 
TGTGTGTAGT GACAATTACC ATGGGTGTTT TCGAAAGTGA AATTAATGGG AAAATTeCTG 



15 



1870 



20 



1880 1890 1900 1910 1920 

* w ******* * 

CTTTGTCAAA CAACCAGCGC TCTACAATAT AATGACAATG TCCAGGTAGT CCTCATTCTT 
GAAACAGTTT GTTGGTCGCG AGATGTTATA TTACTGTTAC AGGTCCATCA GGAGTAAGAA 

1930 1940 1950 I960 1970 1980 

********** 
CTCATAGTAG TTGGCTTAAT AATAATTTCC GCTAGTTTAA TATTGCTTTA TTGCCACC3C 
GAGTATCATC AACCGAATTA TTATTAAAGG CGATCAAATT ATAACGAAAT AACGGTGGCG 



25 



1990 



30 



35 



2000 2010 2020 2030 2040 

********* * 
AAAAAAATCA AGGCCGAAGT TCAACATCAA CCAGTGCATA TTTGTTTAGA AAAATAAAAT 
^.jTTTTTAGT TCCGGCTTCA AGTTGTAGTT GGTCACGTAT AAACAAATCT TTTTATTTTA 

2050 2060 2070 2080 2090 2100 

******* 
TTTTTTCTTT TCAGTATGGT AACTCCTCTT CTCCTGCTTG TCTGTCTGCC. AATTATCTAC 
AAAAAAGAAA AGTCATACCA TPGAGGAGAA GAGGACGAAC AGACAGACGG TTAATAGATG 

Figure 4 - Conc'd 
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2110 2120 2130 2140 2130 216C 



5 



10 



15 



20 



* * * 



* * * * 



25 



30 



35 



GCCTCCACCA CCTTCGCCGC AGTCTCCCAC CTTGATACGG ATTGTCTTCC CGCCTT3CTG 
CGGAGGTGGT GGAAGCGGCG TCAGAGGGTG GAACTATGCC TAACAGAAGG GCGGAACGAC 

2170 2180 2190 2200 2210 222C 

* * * * * * 

ACTTATCTCA TCTTCACCTC TGTTTGCTGC ACTGCCATCT GCAGCATTGC CACTTTTTTT 

TGAATAGAGT AGAAGTGQAG ACAAACGACG TGACGGTAGA CGTCGTAACG GTGAAAAAAA 

2230 2240 2250 2260 2270 2280 

******" 
GTGGCCATTT TCCAAACTGC GGACTACCTA TACGTTAGAG TGGCATACTA TCGTCATCAT 
CACCGGTAAA AGGTTTGACG CCTGATGGAT ATGCAATCTC ACCGTATGAT AGCAGTSVGTA 

2290 2300 2310 2320 2330 2340 

********** 
CCCCAATATA GGAACCACGA GGTGGCCGCC CTTCTGTGCC TGTCATGAAA GTrCCTCTTC 
GGGGTTATAT CCTTGGTGCT CCACCGGCGG GAAGACAOSG ACAGTACTTT CAAGGAGAAG 

2350 2360 2370 2380 2390 2400 

* * ♦ « * * * * * * 
TCTGTCTTAT CCTCCTTCAC AAAGTCCTGG CCAACTGCCA CCTCCACCGG CCCACCGAGT 
AGACAGAATA GGAGGAAGTG TTTCAGGACC GGTTGACGGT GGAGGTGGCC GGGTGGCTCA 

2410 2420 2430 2440 2450 2460 

******** 
TCCTGCGCTG CTACTCAACA GAAACCTCTT CCTTTTGGCT GTACTCCATT ATTTTTATTT 
AGGACGCGAC GATGAGTTGT CTTTGGAGAA GGAAAACCGA CATGAGGTAA TAAAAATAAA 

2470 2480 2490 2500 2510 2520 

* * * * * *.*. * * * 
TGATTTTCTT TGCCACCTTT TTGGGATTAC AAATTTACGG CTGCCTTCAC CTGGGCTGGA 
ACTAAAAGAA ACGGTGGAAA AACCCTAATG TTTAAATGCC GACGGAAGTG GACCCGACCT 

Fiqure 4- Cont'd 
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5 



10 



15 



20 



25 



2530 



2540 2550 2560 2570 258C 



« * 



* * 



30 



35 



TGCATCCTCC CAACAACCTA CCCAGATTTC CTGGTTTCTT ATTACAGCCC CCGCCGCCCC 

ACGTAGGAGG GTTGTTGGAT GGGTCTAAAG GACCAAAGAA TAATGTCGGG GGCGGCGGGG 

2590 2600 2610 2520 2630 264C 

« *~* * * * * ■» * 

CACCAGCTCC TGTACAGCGC GCTCCATCAG TTATTAGCTA CTTTCATCTT AACTCTGAAG 

GTGGTCGAGG ACATGTCGCG CGAGGTAGTC AATAATCGAT GAAAGTAGAA TTGAGACTTC 

2650 2660 2670 2680 2690 2700 
,****♦•*♦• 

ATGTCTGACC AACTAGAAAT CGACGGGCAG CGCACTGAGC AGCTGATCCT TGCTCGGCGA 

TACAGACTGG TTGATCTTTA GCTGCCCGTC GCGTGACTCG TCGACTAGGA ACGAGCOGCT 

2710 2720 2730 2740 2750 2760 
********** 

AAACTCAAAC AACAAAACCA GGAA.TTGTTC AACCTTCAAG CCTTACACCA ATGCAAAAAG 

TTTGAGTTTG TTGTTTTGGT CCTTAACAAG TTGGAAGTTC GGAATGTGGT TACGTTTTTC 

2770 2780 2790 2800 2810 2820 



★ 



GGTCTTTTCT GCCTGGTTAA ACAAGCTGAA CTTTGCTATG ATGTAACCCA ACAGGGGCAT 

CCAGAAAAGA CGGACCAATT TGTTCGACTT GAAACGATAC TACATTGGGT TGTCCCCGTA 

2830 2840 2850 2860 2870 2880 

* ★ * * * * *.* * * 

GAGCTATCAT ACACTTTAAA CAAGCAAAGA CAGAGCTTTA TGACTATGGT GGGGGTTAAG 

CTCGATAGTA TGTGAAATTT GTTCGTTTCT GTCTCGAAAT ACTGATACCA CCCCCAATTC 

2890 2900 2910 2920 2930 2940 

CCCATTAAGG TTACTCAGCA ATCCGGCCCA GTTGAGGGAA GCATTCTTTG TCAGTGCACC 

GGGTAATTCC AATGAGTCGT TAGGCCGGGT CAACTCCCTT CGTAAGAAAC AGTCACGTGG 

Figure 4 - Cont'd 
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1 2950 2960 2970 2980 2990 3000 



AATTCTGAAT GCATGTACAC TATGGTAAAA ACCCTGTGTG GTCTCAGGGA ACTTCTCCCC 

TTAAGACTTA CGTACATGTG ATACCATTTT TGGGACACAC CAGAGTCCCT TGAAGAGGGG 

5 

3010 3020 3030 3040 3050 3060 

* . * * * * * * 

TTTAATTAAA GTTATCTGAT TAATAAAGCT TACCTTAAAT TTGATATCAG TTGTTTGTCA 
AAATTAATTT CAATAGACTA ATTATTTCGA ATGGAATTTA AACTATAGTC AACAA^.CAGT 

10 

3070 3080 3090 3100 3110 3120 

******* 
AGrrTTTCCA GCAGCACCAC CTGCCCTTCC TCCCAACTTT CGTAGGGGAT GTGCCAACGG 
TCAAAAAGGT CGTCGTGGTG GACGGGAAGG AGGGTTGAAA GCATCCCCTA CACGGTTGqC 

15 

3130 3140 3150 3160 3170 3180 

GCAGCAAACT TTCTCCACGT CCTAAAGGGT ATATCGGTGT TCACCTTTTT ACCCTGACCC 

CGTCGTTTGA AAGAGGTQCA GGATTTCCCA TATAGCCACA AGTGGAAAAA TGGGACTGGG 

20 

3190 3200 3210 3220 3230 3240 

ACGATCTTCA TCTTGCAGAT GAAAAGAACC AGAATTGAAG ACGACTTCAA CCCCGTCTAC 
TGCTAGAAGT AGAACGTCTA CTTTTCTTGG TCTTAACTTC TGCTGAAGTT GGGGCAGATG 

25 

3250 3260 327T) 3280 3290 3300 

* •' * 

CCCTATGACA CCTTCTCAAC TCCCAGCATC CCCTATGTAG CTCCGCCCTT CGTTTCTTCT 
GGGATACTGT GGAAGAGTTG AGGGTCGTAG GGGATACATC GAGGCGGGAA GCAAAGAAGA 

30 

Figure 4 - Cont'd 



35 
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1 3310 3320 3330 3340 3350 3360 

,*****»•** 
GACGGGTTAC AGGAAAAACC CCCAGGAGTT TTAGCACTCA AGTACACTGA CCCCATTACT 
CTGCCCAATG TCCTTTTTGG GGGTCCTCAA AATCGTGAGT TCATGTGACT GGGGTAATGA 

5 

3370 
* * .. 
ACCAATGCTA AGC 
TGGTTACGAT TCG 

10 

Figure - Cont'd 
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1 Met Lys He Cys Vai Leu Phe Cys 

Thr Ser Pro Thr Thr Val Gly Ser 

Gly Thr His Gin Thr Leu Tyr Phe 

Asn Cys Ser Cys Arg Asn Gin Leu 

5 Cys Lys Leu Phe Trp Asp Ala Leu 

Asn Asn Cys Thr Ala Thr Thr Leu 

Pro Tyr Leu Cys He Gly Thr Gly 

Thr Leu Gin Lys Glu Asn Leu Thr 

Tyr Thr Phe Ser Gin Lys Lys He 

10 Ala Phe Val Cys Val He Thr Ala 

Asn Phe Tyr 



15 



Val Leu Ser Leu Thr Ser Ser Leu Arg 
Leu Arg Gin Leu Gin Asp Ser Thr Lys 
Ser Glu Ser Thr Thr Ser He Ala Leu 
Val Gin Trp Arg Ala Asn Arg Gin Phe 
He Val Gin Gly Asn Asn Ser Leu Cys 
Thr Leu Thr Pro Pro Phe Val Pro Gly 
Arg Gly Pro Ser Cys Phe Asn Arg Trp 
Thr Thr Thr Leu Leu Pro Leu Thr Thr 
Tyr Phe Leu Pro He He Ala Leu Leu 
Asn Tyr He Leu He Phe Asn Leu Asp 
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Figure 5 
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1 Met Ala Gly Lys Ala Thr Ser Thr 

Asn Phe Gin lie Phe Met Lys His 

Ala Phe Thr Leu He Thr Leu Leu 

Tyr Asn He Met Thr Met Ser Arg 



He Met Leu Ala Lys Thr Glu Thr Lys 
Thr Ser Leu Leu Met Val Pro Thr Lys 
Thr Thr Phe Val Lys Gin Pro Aia Leu 
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Met Val Thr Pro Leu Leu Leu Leu Val Cys Leu Pro lie He Tyr Ala Ser 

Thr Thr Phe Ala Ala Val Ser His Leu Asp Thr Asp Cys Leu Pro Ala Leu 

Leu Thr Tyr Leu He Phe Thr Ser Val Cys Cys Thr Ala He Cys Ser He 

Ala Thr Phe Phe Val Ala He Phe Gin Thr Ala Asp Tyr Leu Tyr Val Arg 

val Ala Tyr Tyr Arg His His Pro Gin Tyr Arg Asn His Glu Val Ala Ala 

Leu Leu Cys Leu Ser 
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1 Met Lys Val Pro Leu Leu Cys Leu 

Cys His Leu His Arg Pro Thr Glu 

Ser Ser Phe Trp Leu Tyr Ser lie 

Phe Leu Gly Leu Gin lie Tyr Gly 

5 Pro Asn Asn Leu Pro Arg Phe Pro 

Pro Pro Ala Pro Val Gin Arg Ala 

Asn Ser Glu -Asp Val 

10 
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Phe Leu Arg Cys Tyr Ser Thr- Glu Thr 
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Cys Tyr Asp Val Thr Gin Gin Gly His Glu Leu Ser Tyr Thr Leu Asn Lys 

Gin Arg Gin Ser Phe Met Thr Met Val Gly Val Lys Pro He Lys Val Thr 

Gin Gin Ser Gly Pro Val Glu Gly Ser He Leu Cys Gin Cys Thr Asn Ser 
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